10/581 68a 

WO 2005/057207 PCT/GB2004/005 140 , 

AP3 Bec'd Wr/PTO 0 6 JUNl^ 

TRTTYL DERIVATIVES FOR ENHANCING MASS SPECTROMETRY 

All documents cited herein are incorporated by reference in their entirety. 
TECHNICAL FIELD 

This invention relates to derivatised biopolymers and ions obtainable therefrom. The invention 
5 further relates to compounds and solid supports useful for producing the derivatised biopolymers and 
ions of the invention. 

BACKGROUND OF THE INVENTION 

Mass spectrometry is a versatile analytical technique possessing excellent detection range and speed 
of detection with respect to High Performance Liquid Chromatography (HPLC), Gas 
10 Chromatography (GC), Infra-Red (IR) and Nuclear Magnetic Resonance (NMR). 

However, many biopolymers, such as carbohydrates and proteins, are difBcult to analyse using mass 
spectrometry due to significant difficulties in ionising the biopolymer, even using Matrix Assisted 
Laser Desorption/Ionisation Time Of Flight (MALDI-TOF) techniques. Despite the considerable 
resolving power of 2D-PAGE, this technology has fallen far short of the ultimate goal of displaying 
15 the whole proteome in a single experiment, as many proteins are resistance to 2D-PAGE analysis (e.g 
those with low or high molecular masses, membrane proteins, proteins with extreme isoelectric 
points, etc). Many proteins are thus invisible to 2-D PAGE [Cravatt & Sorensen (2000) Current 
Opinion in Chemical Biology vol. 4, p. 663-668]. 

There is thus a need for improvements in mass spectrometry analysis of biopolymers. 

20 DISCLOSURE OF THE INVENTION 

It has now been found that covalent attachment of trityl derivatives to biopolymers can improve the 
ionisation properties of the biopolymer. The ions (formula (I) below) formed by ionisation of the 
derivatised biopolymers are particularly suitable for mass spectrometry analysis, and biopolymers 
derivatised as specified in formulae (Ilia) and (Ulb) below can be readily ionised. 

25 Whereas triphenylmethyl derivatives covalently attached to certain biopolymers {e.g. DNA) are 
known in the prior art [e.g. Chem. Soc. Rev. (2003) 32, p. 3-13], the prior art attaches the polymer to 
the a-triphenylmethyl carbon atom through a non-aromatic linker. In contrast, under the present 
invention the biopolymer is attached to the a-triarylmethyl carbon atom via an aromatic group 
adjacent to the central carbon atom. Consequently, ionisation of the prior art derivatives results in 

30 separation of the triphenybnethyl derivative and the biopolymer, whereas according to the present 
invention the biopolymer remains bound to the trityl derivative on ionisation, thereby allowing 
analysis of the biopolymer by mass spectrometry. 

The invention provides methods of forming ions firom covalent or ionic compounds and solid 
substrates. 
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Derivatised Biopofymers 

The invention provides a method of forming an ion of formula (I): 

(Ai2)„— C— [Ar*-(LM{M'— Bp'}p) 

* (I) 

comprising the steps of: 
5 (i) reacting a compound of the formula (Ha): 

{Ax\- C— [Ar^— (LM{M}p)q]n, 

^ (Ila); 
with a biopolymer. Bp, having at least one group capable of reacting with M to form a covalent 
linkage, to provide a biopolymer derivative of the formula (Ilia): 

(Ar2)n-C-[Ari-(LM{M'-Bp'}p)J^ 

^ (Ilia); and 

10 (ii) cleaving the C — X bond between X and the a-carbon atom of the derivative of 

formula (ma) to form the ion of formula (I); 

where: 

C* is a carbon atom bearing a single positive charge or a single negative charge; 

X is a group capable of being cleaved from the a-carbon atom to form an ion of formula (I); 

IS M is independently a group capable of reacting with Bp to form the covalent linkage; 

Bp' is independently the biopolymer residue of Bp produced on formation of the covalent 
linkage; 

M' is independently the residue of M produced on formation of the covalent linkage; 
Ar^ is independently an aromatic group or an aromatic group substituted with one or more A; 
20 Ai^ is independently an aromatic group or an aromatic group substituted with one or more A; 

optionally wherein (a) two or three of the groups Ar^ and Ar^ are linked together by 
one or more L^, where is independently a single bond or a linker atom or group; and/or (b) 
two or three of the groups Ar^ and Ar^ together form an aromatic group or an aromatic group 
substituted with one or more A; 

25 A is independently a substituent; 

Lm is independently a single bond or a linker atom or group; 

n = 0, 1 or 2 and m = 1, 2, or 3, provided the sum of n+m = 3; 

p independently = 1 or more; and 

q independently = 1 or more. 
30 The invention further provides a method of forming an ion of formula (I), comprising the steps of: 
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(0 



reacting a compound of the fonnula (lib): 



(Ar2)^— C— [Ar»— (LM{M}p)ql 



aib); 



with a biopolymer^ Bp, having at least one group capable of reacting with M to form a covalent 
linkage, to provide a biopolymer derivative of the formula (Illb): 



dissociating X* from the derivative of fonnula (Illb), to form the ion of formula (I); 
where: 

X*is a counter-ion to C*; 

and C*, M, Bp , M', Ar^ Ar^, Lm, n, m, p and q are as defined above. 

1 0 The compounds of formulae (Ha) or (lib) may optionally be purified after step (i). 

The invention also provides biopolymer derivatives of the formula (Ilia) or (Illb), as defined above. 
The biopolymer derivatives of the invention have enhanced ionisability with respect to free 
biopolymer. Bp. Advantageously, the biopolymer derivatives may not require a matrix (e.g, as used 
in MALDI-MS) in order to elicit ionisation, although a matrix may help to enhance ionisation. 
15 Preferably, ionisation may be obtained without requiring acid treatment, in particular by direct laser 
illumination. 

The invention also provides ions of formula (I), as defined above. These ions are stabilised by the 
resonance effect of the aromatic groups Ar* and Ar^. Electron-withdrawing groups, when C* is an 
anion, or electron-donating groups, when C* is a cation, may optionally be provided on Ar^ and/or 
20 Ar^ to assist this resonance effect. Consequently, the biopolymer derivatives of the invention readily 
form ions of formula (I) relative to the native biopolymer. Bp. 

The ions of formula (I) are generally only ever seen on a mass spectrum with a single charge, which 
is advantageous since it reduces cluttering of the mass spectrum. 

The invention also provides compoimds of the formula (Da) and (lib), as defined above. As 
25 mentioned above, these compounds are useful for forming ions of formula (I). As the difference in 
the molecular mass of the ions of formula (I) and that of the free biopolymer can be accurately 
calculated, the derivatised compounds of the invention allow analysis of the biopolymer Bp, which 
may be otherwise difficult or impossible to analyse using known mass spectrometrical techniques. 

Other advantageous features of the compounds of the invention include more uniformity of the signal 
30 intensity between different analytes (useful for quantitative studies) and similar desorption properties 



5 



(Ar2)„-C-[Ar^-(LM{M'— Bp'}p)ql 
X* 



Im 



(Illb); and 
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between compounds with different, but close, masses, so that techniques such as isotope coded 
affinity tagging (ICAT) can be employed with the compounds of the invention. 

The homogeneous methods of the invention are particularly appropriate for small molecules, e.g. 
ammes. 

S Solid Supports 

The ions of formula (I) may also be formed using a derivatised solid support. 
The invention therefore provides a method of forming an ion of formula (I) comprising the steps of: 
(i) reacting a solid support of formula (TVai), (TVaii), or (IVaiii): 

(Ai2)tf-C— [Ar»— (LM{M>p)qL, 



(IVai); 




^Ss 

Ss^ 




^Arl— (LM{M}p)q 
(Ax2)„— ^— [At'— (LM{M}p)qW, 



10 ^ (TVaii); 




S 



s 



I 



(Ai2)„.,— C— [Ar»— (LM{M}p)q]„ 

(IVaiii); 

with a biopolymer. Bp, having at least one group capable of reacting with M to form a covalent 
linkage, to provide a modified solid support of the formula (Vai), (Vaii), or (Vaiii), respectively: 

(Ar2)„-C— [Ar»— (LmIM-— Bp'}p)q]„ 

t 
I 
I 

(Vai)); 





Ss 



^Ari (Lm{M' Bp'}p)q 

(Ar2)„— C— [Ar'— (Lm{M' Bp'}p)q]m-i 

15 ^ (Vaii); 
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(Vaiii); 



and either: 



(iia) for modified solid supports of formula (Vai) cleaving the C-Ss bond between 



the a-carbon atom of the modified solid support of formula (Vai) and the solid support Ss to form the 
S ion of formula (I); 



sequentially, cleaving the C-X bond between X and the a-carbon atom and cleaving the Ss- - -Ar^ 
bond between the solid support and the Ar^ group to form the ion of formula (I); or 

(iic) for modified solid supports of formula (Vaiii), either simultaneously or 
10 sequentially, cleaving the C-X bond between X and the a-carbon atom and cleaving the Ss — Ar^ 
bond between the solid support and the Ar^ group to form the ion of formula (I); 
where: 



- The cleavable bond of C- - -Ss, Ss- - -Ar^ or Ss- - -Ar^ may be a covalent, ionic, hydrogen, dipole-dipole 
or van der Waals bond* 

20 The invention further provides a method of forming an ion of formula (I) comprising the steps of: 



(iib) for modified solid supports of formula (Vaii), either simultaneously or 



15 



X, Ar*, Ar^, Bp*, Lm, M, M', n, m, p and q are as defined above; 
Ss is a solid support; 

C- - -Ss comprises a cleavable bond between C and Ss; 
Ss- --Ar^ comprises a cleavable bond between Ar^ and Ss; and 
comprises a cleavable bond between Ar^ and Ss- 



(i) 



reacting a solid support of formula (TVbii) or (TVbiii): 




^Ar^ (LM{M>p)q 

(Ar2)„ — C— [Ar'— (LM{M}p)q]^, 



(IVbii); 





(TVbiii); 
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with a biopolymer. Bp, having at least one group capable of reacting with M to form a covalent 
linkage, to provide a modified solid support of the formula (Vbii) or (Vbiii), respectively: 




Ss 



ArJ (LM{]Vr-Bp»}p)q 



(Ar2)„— C— [Ar'— (Lm{M'— Bp7p)J„., 

(Vbii); 



Ss 

(Ar^)^,— C— [Ar^— (LM{M'-Bp'}p)J^ 

^* (Vbiii); 




5 and either: 



(iia) for modified solid supports of formula (Vbii), either simultaneously or 
sequentially, dissociating X* from the derivative of formula (Vbii) and cleaving the Ss- - -Ar^ bond 
between the solid support and flie Ar^ group to form an ion of formula (I); or 

(iib) for modified solid supports of formula (Vbiii), either simultaneously or 
10 sequentially, dissociating X* from the derivative of formula (Vbiii) and cleaving the Ss- - -Ar^ bond 

between the solid support and the Ar^ group to form an ion of formula (I); 

where: X*, Ar\ Ar^, Bp', Lm, M, M', n, m, p, q, Ss, C- - -Ss, Ss- - -Ar' and Ss- - -Ar^ are as defined 
above. 

The invention further provides a method of forming an ion of formula (I) comprising the steps of: 
1 5 (i) reacting a solid support of formula (TVaiv) or (IVbiv): 



{M}p.,LMM"--(sr) 



Ar^ (LM{M}p)q., 

(Ar2)„— C— [Ar»— (LM{M}p)q]B^, 



X 



(IVaiv); 

{M}^,LMM"--(sr) 

Ar^(LM{M}p)q., 
(Ar2)„.-_C— [Ar»-(LM{M}p)qk. 

^* (IVbiv); 
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with a biopolymer. Bp, having at least one group capable of reacting with M to form a covalent 
linkage, to provide a modified solid support of the formula (Vaiv) or (Vbiv), respectively: 

{Bp*— M"}p.iLM{M^Bp'} 

Ari— (LM{M'-Bp'}p)q., 
(Ar\— C— [Ar.^— (LM{M»-Bp'}p)J„,., 

^ (Vaiv); 

{Bp'— M'}p.iLM{M^Bp»} 



Ari— (Lm{M'-Bp'}pVi 
(Ar2)n— C— [Ar^— (LM{M'-Bp'}p)q]„,.i 

^* (Vbiv); 

5 and either: 

(iia) for modified solid supports of formula (Vaiv), cleaving the C-X bond 
between X and the a-carbon atom to form the ion of formula (I); or 

(iib) for modified solid supports of formula (Vbiv), dissociating X* from the 
derivative of formula (Vbiv) to form the ion of formula (I); 

10 where: 

X, X*, Ar', Ai^, Bp', Lm, M, M*, p, q, n, m, and Ss are as defined above; 
M" — Ss comprises a bond between M" and Ss; and 

M" is the same as M except that Ss is bound to a portion of M which does not form part of 

15 In this embodiment of the invention, the solid support is bound to a part of group M" which does not 
go on to form the residue M'. Thus, the derivatised biopolymer will be released from the solid 
support during the derivativisation step and an additional step of cleaving the biopolymer from the 
solid support is not required. 

The modified solid supports of formulae (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) or (Vbiv) may 
20 optionally be washed after step (i). 

The invention also provides solid supports of the formulae (IVai), (TVaii), (TVaiii), (TVaiv), (TVbii), 
(IVbiii) and (TVbiv), as defined above. Similarly, the invention provides modified solid supports of 
the formulae (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii), and (Vbiv), as defined above. 

The heterogeneous methods of the invention are particularly appropriate for synthetic biopolymers, 
25 e.g, oligonucleotides, peptides and carbohydrates. 
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Methods of Analysis 

The invention also provides a method for analysing a biopolymer. Bp, comprising the steps of: 

(i) reacting the biopolymer Bp with a compound of formula (Ila) or (lib) or a solid 
support of formula (IVai), (IVaii), (TVaiii), (IVaiv), (IVbii), (TVbiii) or (IVbiv); 

5 (ii) providing an ion of formula (I); and 

(iii) analysing the ion of formula (I) by mass spectrometry. 

The biopolymer will typically have been obtained using a preparative or analytical process. For 
example, it may have been purified using various separation methods (e.g. 1 -dimensional or 
2-dimensional, reverse-phase or normal-phase separation, by e.g. chromatography or electrophoresis) 
10 and the separation may be based on any of a number of characteristics (e.g. isoelectric point, 
molecular weight, charge, hydrophobicity, etc.). Typical methods include 2D SDS-PAGE , 2D liquid 
chromatography (e.g. Multidimensional Protein Identification Technology, MudPIT, or 2D HPLC 
methods). The separation method can preferably interface directly with the mass spectrometer. 

Known analytical techniques can thus be adapted or improved by the method of the invention. A 
15 particularly preferred method involves 2D-PAGE of a biopolymer, or mixture of biopolymers, 
selection of a spot of interest in the electrophoretogram, and then derivatisation and analysis of that 
spot using the techniques of the invention. The biopolymer may be proteolytically digested prior to 
its analysis (typically within the PAGE gel, but optionally digested after extraction from the gel) 
and/or may itself be the product of a proteolytic digest. 

20 The invention also provides, in a method for analysing a biopolymer. Bp, the improvement consisting 
of: (i) reacting a biopolymer. Bp with a compound of formula (Ha) or (lib) or a solid support of 
formula (IVai), (IVaii), (IVaiii), (IVaiv), (IVbii), (TVbiii) or (IVbiv); (ii) providing an ion of formula 
(I); and (iii) analysing the ion by mass spectrometry. 

Typically, the analysis by mass spectrometry is carried out in a spectrometer which is suitable for 
25 MALDI-TOF spectrometry. 

In the spectrometer, the ion source may be a matrix-assisted laser desorption ionisation (MALDI), an 
electrospray ionisation (ESI) ion source, a Fast-Atom Bombardment (FAB) ion source. Preferably, 
the ion source is a MALDI ion source. The MALDI ion source may be traditional MALDI source 
(under vacuum) or may be an atmospheric pressure MALDI (AP-MALDI) source. MALDI is a 
30 preferred ionisation method, although the use of a matrix is generally not required 

In the spectrometer, the mass analyser may be a time of flight (TOF), quadrupole time of flight 
(Q-TOF), ion trap (IT), quadrupole ion trap (Q-IT), triple quadrupole (QQQ) Ion Trap or Time-Of- 
Flight Time-Of-Flight (TOFTOF) or Fourier transform ion cyclotron resonance (FTICR) mass 
analyser. Preferably, the mass analyser is a TOF mass analyser. 

35 Preferably, the mass spectrometer is a MALDI-TOF mass spectrometer. 
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Further Embodiments 
M hound to Bp by a non-covalent linker 

The above-mentioned embodiments of the invention may also be provided in which M' is bound to 
Bp by a non-covalent bond. All the other features of the invention are the same except the groups 
5 which relate to the non-covalent bond between M* and Bp . 

The non-covalent bond may be direct between M' and Bp or may be provided by one or more 
binding groups present on M* and/or Bp. 

Preferred non-covalent bonds are those having an association constant (Ka) of at least 10^"* M"^, 
preferably about 1 0^^ IVT^ 

10 In preferred embodiment, one of M* and Bp will have a binding group comprising biotin, and the 
other of M* and Bp will have a binding group comprising avidin.or streptavidin. 

Preferably, when the compounds of the invention comprise a non-covalent bond between M' and Bp 
and a cleavable bond between C and Ss, Ar^ and Ss, or Ai^ and Ss, these bonds are differentially 
cleavable. More preferably, the non-covalent bond between M' and Bp is not cleaved under 
15 conditions which the cleavable bond between C and Ss, Ar' and Ss, or Ar^ and Ss, as appropriate, is 
cleaved. 

Lm bound to Ar^ by more than one bond 

The above-mentioned embodiments of the invention may also be provided in which Lm is bound to 
Ar^ by more than one covalent bond {e.g, 2 or 3 bonds) which are either single, double or triple 
20 covalent bonds, or one or more multiple bonds (e.g. double or triple covalent bonds). All the other 
features of the invention are the same except the groups which relate to the bond or bonds between 
Ar' and Lm. 

lonisation of Compounds other than Biopolymers 

In addition to biopolymers, the present invention may be used for ionising any molecule or complex 
25 of molecules which requires mass spectrum analysis. Thus, the above-mentioned embodiments of the 
invention may also be provided in which Bp is replaced by any molecule or complex having at least 
one group capable of reacting with M to form a covalent linkage. All the other features of the 
invention are the same, except group M is group capable of reacting with the molecule to be 
analysed. 

30 Examples of other molecules which may be analysed in the present invention include non-biological 
polymers (e,g, synthetic polyesters, polyamides and polycarbonates), petrochemicals and small 
molecules (e.g. alkanes, alkenes, amines, alcohols, esters and amides). Amines are particularly 
preferred. 
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Examples of complexes which may be analysed in the present invention include double- and triple- 
stranded RNA, DNA and/or peptide nucleic acid (PNA) complexes, enzyme/substrate complexes, 
multimeric proteins (e.g. dimers, trimers, tetramers, pentamers, e/c), virions, etc. 

Preferably, vs^hen the compound to be ionised is not a biopolymer, all embodiments of the invention 
(including products of formulae (I), (Ha), (lib), GHa), (nib), (IVai), (IVaii), (IVaiii), (IVaiv), (IVbii), 
(TVbiii), (IVbiv), (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) and (Vbiv), methods of forming an ion 
of formula (I) and methods of analysis) involving or relating to the compound of formula (XI) are 
disclauned. 



OMe 



solid support * 




OKI 




pci). 



10 Disclaimers 

Preferably, all embodiments of the invention (including products of formulae (I) and (Ua)) involving 
or relating to the compound of formula (XI) are disclaimed 

OMe 



solid support * 





(XI). 

Preferably, all embodiments of the invention (including products of formulae (I) and (Ila)) involving 
IS or relating to the compound of formula (XIa) are disclaimed 

OMe 



MeO 



OMe 




OMe 



(XIa). 



Preferably, all embodiments of the invention (including products of formulae (I) and (Ha)) involving 
or relating to the compound of formula (Xlb) are disclaimed. 



-10- 



wo 2005/057207 



PCT/GB2004/005140 



OMe 



MeO' 




OMe 



OMe 



POb). 



Preferably, all embodiments of the invention (including products of formulae (I) and (Ila)) involving 
or relating to the compound of formula (XIc) are disclaimed 




OMe 



OMe 



(XIc). 



Preferably, all embodiments of the invention (including products of formulae Q) and (Ila)) involving 
or relating to the compound of formula (Xld) are disclaimed 



MeO. 



OMe 



Me^ JL 




OMe 



OMe 



(Xld). 



Preferably, all embodiments of the invention (including products of formulae (T) and (Ha)) involving 
or relating to the compound of formula (Xle) are disclaimed 



PCT/GB2004/005140 




(Xle). 

Preferably, all embodiments of the invention (including products of formulae (I) and (Ha)) involving 
or relating to the compound of formula (Xle) are disclaimed 

OMe OMe 




MeO^ "OMe (Xlf). 

5 Preferably, all embodiments of the invention (including products of formulae (I) and (Ha)) involving 
or relating to the compound of formula (Xlg-j) are disclaimed 

At 

Base 



\^ O OH 




-Ar 



Ar 

Ar = p-anisyl 



Formula 


Base 


Xlg 


Uridine 


Xlh 


N'^-benzoyl-cytidine 


Xli 


N^-benzoyl-adenosine 


Xli 


N^-phenylacetyl-guanosine 



10 Preferably, all embodiments of the invention (including products of formulae (I) and (Ila)) involving 
or relating to the compound of formula (Xlk-n) are disclaimed 
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Base 



OH O 





Ar 

At = p-anisyl 



Formula 


Base 


Xlk 


Uridine 


XII 


N'^-benzoyl-cytidine 


Xlm 


N^-benzoyl-adenosine 


XIn 


N^-phenylacetyl-guanosine 



Preferred Embodiments 

5 Definition ofC^ 

Preferably, C* bears a single positive charge such that ions of the invention are cations and the ion 
of formula (I) has the following structure: 

(Ar^)— C— [Ar^— (Lm{M'— Bp'}p)qk 

and the compounds of formulae (Db), (fflb), (IVbii), (IVbiii), (TVbiv), (Vbii), (Vbiii) and (Vbiv) 
1 0 have the structures disclosed in table 1 . 

n, ?n, p and q 

For the purposes of compounds of the invention having n-1 groups Ar^, n may not be less than 1 . 
Preferably n = 2 and m = 1. 
Preferably p == 1, 2 or 3. Preferably p = 1 . 
1 5 Preferably q = 1, 2 or 3. Preferably q = 1 . 

Preferably n = 2, m = l,p=l and q = 1 . The ion of formula (I) thus has the structure: 

Ai2_c-Ar»-LMM'-Bp' Ar^_i At'-LmM'-Bp' 

« or more pieferably ® * 

and the compounds of fonnulae (Ha), (Ob), (ffla), (Wo), (TVai), (IVaii), (TVaiii), (TVaiv), (IVbii), 
(IVbiii), (IVbiv). (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) and (Vbiv) have the structures 
20 disclosed in table 2. 
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Biopofymers 

The term 'biopolymer* includes polymers found in biological samples, including polypeptides, 
polysaccharides, and polynucleotides {e,g, DNA or RNA). Polypeptides may be simple copolymers 
of amino acids, or they may include post-translational modifications e.g, glycosylation, lipidation, 
5 phosphorylation, etc. Polynucleotides may be single-stranded (in whole or in part), double-stranded 
(in whole or in part), DNA/RNA hybrids, etc, RNA may be mRNA, rRNA or tRNA. 

Advantageous biopolymers are those which do not readily form a molecular ion in known 
MALDI-TOF MS techniques, especially those which do not form a molecular ion on illumination of 
laser light at 340 nm. 

10 Biopolymers for use in the invention comprise two or more monomers, which may be the same or 
different as each other. Preferred biopolymers comprise at least pp monomers, where pp is 5 or more 
{e,g, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 250). More 
preferred biopolymers comprise ppp or fewer monomers where ppp is 300 or less {e.g. 200, 100, 50). 

Biopolymers may have a molecular mass of at least qq kDa, where qq = 0.5 or more (e.g. 0.6, 0.7, 
15 0.8, 0.9, 1, 1.5, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50, 75, 100, etc.). Preferred biopolymers 
are those having a molecular mass within the range of detection of a mass spectrometer. More 
preferred biopolymers have a molecular mass of qqq kDa or less, where qqq is 30 or less {e.g. 20, 10, 
5). 

Preferably, the mass, m(IX), of the fragment (IX) 

(Ar2)„— C— [Ar^-(LM{M»}p)q]„, 
20 * (IX) 

of the cation of formula (I) is significantly less than the mass, m(Bp'), of the biopolymer residue Bp. 
For example the ratio m(Bp ) / m(IX) is preferably more than n«, where nn is at least 2 {e.g. 3, 4, 5, 
10, 100, 1000, etc.). 

The invention is suitable for use with purified biopolymers or mixtures of biopolymers. For example, 
25 a pure recombinant protein could be derivatised and analysed by MS, or biopolymers within a 
cellular lysate or extract could be derivatives and then analysed. 

Preferred biopolymers are pol3T>eptides. Particularly preferred biopolymers are polypeptides formed 
after proteolytic digestion of a protein. 

Biopolymers bound to solid supports 

30 In preferred embodiments of the invention the biopolymer is bound to a solid support such that it is 
cleavable from the solid support at least once it has been derivatised by a compound of the invention. 
Bp is thus derivatised in situ while bound to the support, and is then released. As the biopolymer is 
bound to the solid support, tfiis aspect of the invention is particular relevant to methods involving 
compounds of formulae (Ila) and (lib). 
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The biopolymer may be bound to the solid support by a covalent, ionic, hydrogen, dipole-dipole or 
van der Waals bond (also known as a dispersion bond or a London forces bond). The covalent, ionic, 
hydrogen, dipole-dipole or van der Waals bond may be direct between the biopolymer and the solid 
support or may be provided by one or more binding groups present on the biopolymer and/or solid 
5 support. Preferred groups are non-covalent groups. 

Examples of groups which can form these types of bond, and methods for cleaving these types of 
bond, are set out below in connection with C- - -Ss bonds, etc. 

In a particularly preferred embodiment, the solid support is provided with — (NMea)*** binding groups 
and the biopolymer has a net negative charge, or vice versa (i.e. the -(NMcs)'*' is on the biopolymer). 
10 In other preferred embodiments, the solid support is provided with anions such as carboxylate, 
phosphate or sulphate, or anions formed from acid groups, and the biopolymer (e.g. a histone) has a 
net positive charge, or vice versa. 

Reactivity with group M 

The biopolymers have at least one reactive group capable of reactmg with M to form a covalent 
15 linkage. Such groups typically include naturally occurring groups and groups formed synthetically on 
the biopolymer. 

Naturally occurring groups include lipid groups of lipoproteins (e.g. myristoyl, 
glycosylphosphatidylinositol, ethanolamine phosphoglycerol, palmitate, stearate, S- or N- or O-acyl 
groups, lipoic acid, isoprenyl, geranylgeranyl, famesyl, e/c), amide, carbohydrate groups of iV^ and 
20 O- glycoproteins, amine groups (e.g. on lysine residues or at the N-terminus of a protein), hydroxyl 
(e.g. in p-hydroxyaspartate, P-hydroxyasparagine, 5-hydroxylysine, %-hydroxyproline), thiol, 
sulfhydryl, phosphoryl, sulfate, methyl, acetyl, formyl (e.g. on N-terminal methionines from 
prokaryotes), phenyl, indolyl, guanidyl, hydroxyl, phosphate, methylthio, ADP-ribosyl etc. 

The reactive group is bound to the biopolymer by one or more covalent bonds (e.g. 2 or 3 bonds), 
25 which are either single, double or triple covalent bonds (preferably single bonds). Preferably, the 
reactive group is bound to the biopolymer by one single bond. 

Groups which may be formed naturally or synthetically on the biopolymer and which are bound to 
the biopolymer by one bond include: «NR2 e.g. -NHR, especially -NH2; -SR e.g. -SH; -OR e.g. -OH; 
-B(R)Y; -BY2; -C(R)2Y; .C(R)Y2; -CY3; -C(=Z)Y e.g. -C(=0)Y; -Z-C(=Z)Y; -C(=Z)R e.g -C(=Z)H, 
30 especially -C(=0)H; -C(R)(OH)OR; -C(R)(OR)2; -S(=0)Y; -Z-S(=0)Y; -S(=0)2Y; -Z-S(=0)2Y; 
-S(=0)3Y; -Z-S(=0)3Y; -P(=Z)(ZR)Y e.g -P(=0)(0H)Y; -P(=Z)Y2; -Z.P(=Z)(ZR)Y; -Z-P(=Z)Y2; 
-P(=Z)(R)Y e.g -P(=0)(H)Y; -Z-P(=Z)(R)Y; or -N=C(=Z) e.g -N=C(=0). 

Another group which may be formed naturally or synthetically on the biopolymer and which is 
bound to the biopolymer by one bond is -CN. 
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Other groups which may be formed naturally or synthetically on the biopolymer and which are 
bound to the biopolymer by one bond are: -P(ZR)Y e.g. -P(OH)Y; -PYa; -Z-P(ZR)Y; -Z-PY2; -P(R)Y 
e.g. -P(H)Y; -Z-P(R)Y. A particularly preferred group is -Z-P(ZR)Y, especially a phosphoramidite 
group: 



Another example of a group which may be formed naturally or synthetically on the biopolymer and 
which is bound to the biopolymer by one bond is — Y. In particular, when the reactive group is halo 
(especially iodo), the reactive group may be bound to an aliphatic or aromatic carbon. 

Groups which may be formed synthetically on the biopolymer and which are bound to the 
10 biopolymer by two bonds include -N(R)- e.g, -NH-; -S-; -0-; -B(Y)-; -C(R)(Y)-; -CY2-; -C(=0)-; 
-C(OH)(OR)s -C(OR)2-. 

Groups which may be formed synthetically on the biopoljmier and which are bound to the 

_l _ 

biopolymer by three bonds include " 

-C(Y)— . 

Preferred groups include nucleophilic groups, either natural or synthetic, e,g,: -NR2 e.g. -NHR, 
15 especially -NH2; -SR e.g. -SH; -OR e.g. -OH; -N(R)- e.g. -NH-; -S-; and -0-. The groups -NH2, -SH 
and -OH are particularly preferred. 

Another preferred reactive group is maleimidyl: 



Y is independently a leaving group, including groups capable of leaving in an SN2 substitution 
20 reaction or being eliminated in an addition-elimination reaction with the reactive group of the 
biopolymer Bp. 

Preferred examples of Y include halogen (preferably iodo), Ci-shydrocarbyloxy (e.g. Ci^alkoxy), 
Ci-ghydrocarbyloxy substituted with one or more A, Ci.gheterohydrocarbyloxy, 
Ci^eterohydrocarbyloxy substituted with one or more A, mesyl, tosyl, pentafluorophenyl, 
25 -O-succinimidyl (formula VII) or a sulfo sodium salt thereof (sulfoNHS — formula Vila), 
-S-succinimidyl, or phenyloxy substituted with one or more A e.g. p-nitrophenyloxy (formula VIII) 
or pentafluorophenoxy (formula Vnia). 




N(iPr)2 



O 
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O O 



5 




Other preferred examples of Y include -ZR. Particularly preferred examples of Y are -ZH {e.g. -OH 
or -NH2) and -Z-Ci.8alkyl groups such as -NH-Ci-galkyl groups {e.g. -NHMe) and -O-Ci^alkyl 
10 groups {e.g. -O-t-butyl). Thus, preferred reactive groups are -C(0)-NH-Ci^alkyl and -C(0)-0-Ci. 
gaikyl {e.g. -C(0)-0-t-butyi). 

Other preferred examples of Y include -Z-ZR. Particularly preferred examples include -NR-NRi, 
especially -NH-NH2, and -ONR2, especially -O-ISIH2. 

Z is independently O, S orN(R). Preferred (=Z) is (=0). 

15 R is independently H, Ci-ghydrocarbyl {e.g. Ci-galkyl) or Ci^hydrocarbyl substituted with one or 
more A. 

R is preferably H. 
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Other preferred reactive groups include -C(=0)Y, especially -C(=0)-0-succiniinidyl and 
-C(=0)-0-(p-nitrophenyl). 

In a further embodiment, the reactive group may be -Si(R)2-Y, with Y bemg halo (e.g. chloro) being 
especially preferred. Preferred groups R in this embodiment are Ci^alkyl, especially methyl. A 
5 particularly preferred reactive group in this embodiment is -Si(Me)2CL 

Other groups which may be formed naturally or synthetically on the biopolymer include groups 
capable of reacting in a cycloaddition reaction, especially a Diels-Alder reaction. 

In the case of Diels-Alder reactions, the reactive group on the biopolymer is either a diene or a 
dienophile. Preferred diene groups are 



10 




and multivalent derivatives formally formed by removal of one or more hydrogen atoms, where A is 
-R' or -Z'R^ where R* and are defined below. 

Preferred dienophile groups are -CR*=CR^2, -CR^=C(R*)A^, -<:A^=CR"2, -CA^C(R^)A^ or 
-CA^=CA\, and multivalent derivatives formally formed by removal of one or more hydrogen 
15 atoms, where R* is defined below and A^ is independently halogen, trihalomethyl, -NO2, -CN, 
-N^(R')20-, -CO2H, -C02R\ -SO3H, -SOR\ -S02R\ -SOsRS -0C(=0)0R^ -C(=0)H, -C(=0)R\ 
-OC(=0)R^ , -0C(=0)]SIR'2, -N(R')C(=0)R^ .C(=S)NR*2, -NR^C(=S)R', -S02NR*2, -NR'S02R\ 
-N(R*)C(=S)NR'2, or -N(R^)S02NR'2, where R' is defined below. A particularly preferred dienophile 
group is maleimidyl. 

20 Group M 

The group M is capable of reacting with the reactive group of the biopolymer. Bp, to form a covalent 
linkage. [Group 'M' is shown as *AFG' in the drawings]. 

The group M is bound to Lm by one or more covalent bonds (e.g. 2 or 3 bonds, especially 2 such 

— Lm"^ ^ 

as — ^ ), which are either single, double or triple covalent bonds (preferably single bonds). 
25 Preferably, M is boxmd to Lm by one single bond. 

Alternatively, or in addition, M is bound by more than one Lm> such Lm either being attached to the 
same or different Ar' or Ai^. In a preferred embodiment M is bound by more than one Lm from 
different Ar' or Ai^, e.g. : 
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Ar2— c— Ar^^ 




★ 



Examples of group M bound to Lm by one bond include -NR2 e.g. -NHR, especially -NH2; -SR e.g. 
-SH; -OR e.g -OH; -B(R)Y; -BY2; -C(R)2Y; -C(R)Y2; -CY3; -C(=Z)Y e.^. -C(=0)Y; -Z-C(=Z)Y; 
-C(=Z)R e.^. -C(=Z)H, especially -C(=0)H; -C(R)(OH)OR; -C(R)(OR)2; -S(=0)Y; -Z-S(=0)Y; 
5 -S(=0)2Y; -Z.S(=0)2Y; -S(=0)3Y; -Z-S(=0)3Y; -P(=Z)(ZR)Y e.g. -P(=0)(OH)Y; -P(=Z)Y2; 
-Z-P(=Z)(ZR)Y; -Z-P(=Z)Y2; -P(=Z)(R)Y e.g. -P(=0)(H)Y; -Z-P(=Z)(R)Y; or -N=C(=Z) e.g. 
-N=C(=0). 

Another example of a group M bound to by one bond is -CN. 

Other examples of group M bound to Lm by one bond are -P(ZR)Y e.g. -P(OH)Y; -PY2; -Z-P(ZR)Y; 
10 -Z-PY2; -PCR)Y e.g. -P(H)Y; -Z-P(R)Y. A particularly preferred group M is -Z-P(ZR)Y, especially a 
phosphoramidite group: 



Another example of group M bound to Lm by one bond is -Y. In particular, when group M is halo 
(especially iodo), M may be boimd to an aliphatic or aromatic carbon. When M is halo {e.g. iodo) 
15 and is bound to an aromatic carbon, Lm may, for example, be a single bond. 



Examples of group M bound to Lm by two bonds include -N(R)- e.g. -NH-; -S-; -0-; -B(Y)-; 
-C(R)(Y)-; -CY2-; -C(=0)-; -C(OH)(OR)-; -C(OR)2-. 



Preferred groups M include electrophilic groiq>s, especially those susceptible to SN2 substitution 



20 reactions, addition-elimination reactions and addition reactions, e.g. -B(R)Y; -BY2; -C(B^Y; 
-C(R)Y2; -CY3; -C(=Z)Y e.g -C(=0)Y; -Z-C(=Z)Y; -C(=Z)R e.g. -C(=Z)H, especially -C(=0)H; 
-C(R)(0H)0R; -C(R)(OR)2; -S(=0)Y; -Z-S(=0)Y; -S(=0)2Y; -Z-S(=0)2Y; -S(=0)3Y; -Z-S(=0)3Y; 
-P(=Z)(ZR)Y e.g. -P(=0)(0H)Y; -P(=Z)Y2; -Z-P(=Z)(ZR)Y; -Z-P(=Z)Y2; -P(=Z)(R)Y e.g. 
-P(=0)(R)Y; -Z-P(=Z)(H)Y; -N=C(=Z) e.g -N=C(=0); -B(Y)-; -C(R.)(Y)-; -CY2-; -C(=0)-; 



25 -C(OH)(OR)-; -C(OR)2-; or — C(Y) 

Another preferred electrophilic group M is -CN. 

Still further preferred examples of group M are orthoesters, e.g. -C(OR)3. In a preferred embodiment, 
the R groups are linked together to form a hydrocarbyl group, e.g. a Ci^galkyl group. A preferred 
example of group M in this embodiment is: 




N(iPr)2 



Examples of group M bound to Lm by three bonds include ^00' 
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Another preferred group M is maleimido. 

Y, Z and R are defined as above. Preferred Y groups when present on M are those capable of leaving 
in an SN2 substitution reaction or being eliminated in an addition-elimination reaction with the 
5 reactive group of the biopolymer Bp. 

Preferred examples of Y include halogen (preferably iodo), Ci^hydrocarbyloxy {e.g. Ci-salkoxy), 
Ci.8hydrocarbyloxy substituted with one or more A, Ci-gheterohydrocarbyloxy, 
Ci^heterohydrocarbyloxy substituted with one or more A, mesyl, tosyl, pentafluorophenyl, 
-O-succinimidyl (formula VH) or a sulfo sodium salt thereof (sulfoNHS - formula Vila), 
10 -S-succinimidyl, or phenyloxy substituted with one or more A e.g. p-nitrophenyloxy (formula Vni) 
or pentafluorophenoxy (formula VUIa). 



O O 




Thus, preferred groups M are: 

15 ^ ^ ^ ^ ^ . 

Other preferred examples of Y include — ZR. Particularly preferred examples of Y are -ZH (e.g. -OH 
or -NH2) and -Z-Ci-galkyl groups such as -NH-Ci^alkyl groups (e.g. -NHMe) and -O-Ci^alkyl 
groups (e.g. -O-t-butyl). Thus, preferred groups M are -C(0)-NH-Ci^alkyl (e.g. -C(O)NHMe) and 
-C(0>0-Ci^alkyl (e.g -C(O)-O-t-butyl). 

20 Other preferred examples of Y include -Z-ZR. Particularly preferred examples include -NR-NR2, 
especially -NH-NH2, and -ONR2, especially -O-NH2. 

Particularly preferred groups M include -C(=0)Y, especially -C(=0)-0-succiniraidyl and 
-C(=0)-0-(p-nitrophenyl). 
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« 

In a further embodiment, M may be -Si(R)2-Y, with Y being halo (e.g. chloro) being especially 
preferred. Preferred groups R in this embodiment are Ci^alkyl, especially methyl. A particularly 
preferred group M in this embodiment is -SiOVIe)2Cl. 

In a further embodiment, M may be -C(Ar^)2X. Preferred groups Ar^ and X are set out below. In this 
5 embodiment it is preferred that Lm is a bond. A particularly preferred group M in this embodiment is: 



OMe 




OMe 



Other groups M include groups capable of reacting in a cycloaddition reaction, especially a Diels- 
Alder reaction. 

In the case of Diels-Alder reactions, the reactive group on the biopolymer is either a diene or a 
10 dienophile. Preferred diene groups are 




and multivalent derivatives formally formed by removal of one or more hydrogen atoms, where A is 
-R^ or -Z^R', where R^ and l) are defined below. 

Preferred dienophile groups are -CR^=CR^2, ~CR^=C(R^)A^, -CA^=CR^2, -CA^=C(R')A^ or 
15 -CA^=CA^2, and multivalent derivatives formally formed by removal of one or more hydrogen 
atoms, where R^ is defined below and A^ is independently halogen, trihalomethyl, -NO2, -CN, 
-N^(R^)20-, -CO2H, -C02R^ -SO3H, -SOR^ -S02R^ -S03R^ -OC(=0)OR^ -C(=0)H, .C(=0)R\ 
-0C(=0)R', , -0C(=0)NR*2, -N(R^)C(=0)R\ -C(=S)NR^2> -NR^C(=S)R^ -S02NR^2, -NR^S02R\ 
-N(R^)C(=S)NR'2, or -N(R^)S02NR^2, where R^ is defined below. A particularly preferred dienophile 
20 group is maleimidyl. 

Preferred examples of group M are shown in figures 1 1 A and 1 IB. 
Matching Bp and M 

The reactive group on the biopolymer [shown as *F' in the drawings] and the group M [shown as 
*AFG' in the drawings] must be dependently selected in order to form the covalent linkage. For 
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example, where the biopolymer includes the groups -NH2, -OH or -SH, M will typically be -B(R)Y; 
-BY2; -C(R)2Y; -C(R)Y2; -CY3; -C(=Z)Y e.g. -C(=0)Y; -Z-C(=Z)Y; -C(=Z)R e.g. -C(=Z)H, 
especially -C(=0)H; -C(R)(OH)OR; -C(R)(OR)2; -S(=0)Y; -Z-S(-0)Y; -S(=0)2Y; .Z-S(=0)2Y; 
-S(=0)3Y; -Z-S(=0)3Y; -P(=Z)(ZR)Y e.g, -P(-0)(OH)Y; -P(=Z)Y2; -Z-P(=Z)(ZR)Y; «Z-P(=Z)Y2; 
5 .P(=Z)(R)Y e.g -P(=0)(H)Y; -Z-P(=Z)(R)Y; -N=C(=Z) e.g. -N=C(=0); -B(Y)s -C(R)(Y)s -CY2S 

I 

-C(=0)s -C(OH)(OR)-; -C(OR)2-; or — C(Y) m j^ay also be-CN. 

In a preferred embodiment, one of the reactive group on the biopolymer and group M is a maleimidyl 
and the other will be a -SH group. 

Alternatively, when the covalent linkage is to be formed by a Diels Alder reaction, one of the 
10 reactive group on the biopolymer and group M will typically be a diene and the other will be a 
dienophile. 



Preferred covalent linkages are those produced through the reaction of the following groups: 



M 


Group on Bp 


Obtained Linkage M'-Bp' 


-C(=0)-0-succinimidyl [i.e. carboxy-NHS] 


-NH2 


-CO-NH- 


-C(=0)-0-(p-nitrophenyl) 


-NH2 


-CO-NH- 


-C(=0)-pentafluorophenyl 


-NH2 


-CO-NH- 


Biotin 


avidin / streptavidin 


biotin-(stFept)avidin 


0 


-SH 




0 




* 0 


-N=C==S (isothiocyanate) 


-NH2 


-NH-CS-NH- 



The covalent residue M'-Bp is the reaction product of M and Bp. Bp* will generally be the same as Bp 
15 except that instead of the reactive group. Bp* will have a residue of the reactive group covalently 
bound to the residue M*. Depending on the choice of the reactive group and the choice of M, M* and 
the residue of the reactive group will typically form linkages, in the orientation Lm-M'-Bp , including 
-C(R)2Z-, -ZC(R)2-, -C(=Z)Z-, -ZC(=Z)-, -ZC(=Z)Z-, -C(OH)(R)Z-, -ZC(OH)(R>, -C(R)(OR)Z-, 
.ZC(R)(OR)-, -C(R)(OR)Z-, -ZC(R)(OR)-, •.S(=0)Z-, -ZS(=0>, -ZS(=0)Z-, -S(=0)2Z-, -ZS(=OV, 
20 "ZS(=0)2Z-, -S(=0)3Z-, -ZS(=0)3-, -ZS(=0)3Z-, -P(=Z)(ZR)Z-, -ZP(=Z)(ZR)-, -ZP(=Z)(ZR)Z., 
-P(=Z)(R)Z-, -ZP(=Z)(R)-, -ZP(=Z)(R)Z-, -NH-C(=Z)-Z-, where Z and R are as defined above. 
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Group M** 



M" is the same as M except that the group Ss is bound to a portion of M which does not form part of 
M'. Thus, M" is a residue of M formable by the conjugation of M and Ss- However, M" need not 
necessarily be formed by the conjugation of M and Ss. 

5 M*'- - -Ss comprises a covalent, ionic, dipole-dipole, hydrogen, or van der Waals bond. The covalent, 
ionic, hydrogen, dipoleKlipole or van der Waals bond may be direct between M" and Ss or may be 
provided by one or more binding groups present on M" and/or Ss. 

Examples of groups which can form these types of bond, and methods for cleaving these types of 
bond, are set out below in coimection with C---Ss bonds, etc. 

10 This embodiment of the invention is advantageous, since the derivativisation of the biopolymer will 
also release the derivatised biopolymer from the solid support. Thus, an additional step of cleaving 
the biopolymer from the solid support is not required. 

Preferred groups M" are groups M having a leaving group, wherein the group Ss is bound to the 
leaving group, e.g. groups M mentioned above having a leaving group Y, wherein the group Ss is 
15 bound to the leaving group Y. 

A particularly preferred group M" is: 



Where the group Lm is a linker atom or group, it has a sufficient number of linking covalent bonds to 
20 link Lm to the group Ar' by a single covalent bond (or more, as appropriate) and to link Lm to the p 
instances of M (or M', as appropriate) groups (which may be attached to Lm by one or more bonds). 

The group Lm may be directly bound to the aromatic part of Ar', bound to one or moiie of the 
substituents A of Ar^ or both. Preferably, Lm is bound directly to the aromatic part of Ar^ 

In an alternative embodiment, Lm may be bound to Ls. 

25 When Lm is a linker atom, preferred linker atoms are O or S, particularly O. 

When Lm is a linker group, preferred linker groups, in the orientation Ar'-(LM{M}p)q or 
Ar^-(LM{M'}p)q, as appropriate, are -E^-, -(D^)t-, -(E^-D^)r, -(D^-E^)t-, -E^-.(D^-E^)t- or 
-D^-(E^-D'^)t-, where a sufficient number of linking covalent bonds, in addition to tfie covalent 
bonds at the chain termini shown, are provided on groups E^ and for linking the p instances of M 

30 (or M*) groups. 




O 
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is independently Ci^hydrocarbylene or Cughydrocarbylene substituted with one or more A. 
Preferred are Ci^alkylene, Ci-galkenylene and Cj-gaikynylene, especially Ci^alkylene and 
Ci^galkynylene, each optionally substituted with one or more A (preferably unsubstituted). A 
preferred substituent A is ^H. Preferred Lm in the orientation Ar^-(LM{M}p)q or Ar^-(LM{M'}p)q, as 
5 appropriate, are: -CH2CH2S -C^C-CH2CH2CH2-; -(CH2)5-; -CD2CD2CH2CH2CH2-; -C^-CHa- and 
-CH2CH2CH2-. 

E^, in the orientation Ar^-(LM{M}p)q or Ar^-(LM{M'}p)q, as appropriate, is independently -Z^-, 
-C(=Z^)-, -Z^C(=Z^)-, -C(=Z^)Z^-, -Z^C(=Z^)Z^-, -S(=0)-, .Z^S(=0)-, -S(=0)Z^-, 
-Z^S(=0)Z^-, -S(=0)2-, -Z^S(=0)2-, -S(=0)2Z^-, -Z^S(=0)2Z^-, where Z^ is independently O, S or 

10 N(R^) and where is independently H, Ci-shydrocarbyl {e.g. Ci^alkyl) or Ci-ghydrocarbyl 
substituted with one or more A. Preferably is, in the orientation Ar*-(LM{M}p)q or 
Ar^-(LM{M*}p)q, as appropriate, -S-, -C(=0)-, -C(=0)0-, -C(=S)-, -C(=S)0-, -OC(=S>, 

-C(=0)S-, -SC(=0)., -S(0)-, -S(0)2-, -NR^-, -C(=0)N(R^)., -C(=S)N(R^)-, -N(R^)C(=0)-., 
-N(R^)C(=S)-, -S(=0)N(R^)-, -N(R^)S(=0)-, .S(=0)2N(R^)-, ~N(R^)S(=0)2-, -0C(=0)0., 

15 -SC(=0)0-, .OC(=0)S-, -N(R^)C(=0)0-, -OC(-0)N(R^>, -N(R^)C(=0)N(R^K 
-N(R^)C(=S)N(R'^)-, -N(R^)S(=0)N(R^)- or-N(R^)S(=0)2N(R^)-. 

Altemative groups E^ to those defined above, in the orientation Ar^-(LM{M}p)q or Ar'-(LM{M'}p)q, 
as appropriate, are -Z'^-Si(R%-Z'^-, -Si(R^)2-Z'^- and -Z^-Si(R^)2-. The group -Si(R^)2-Z^- is 
particularly preferred. Z^ is preferably O. is preferably Ci^alkyl, preferably methyl. These 
20 groups E^ are particularly preferred in the groups -(E^-D^)t-, especially when t=l and D'^ is 
Ci-8alkylene. The following group is especially preferred: 



In addition to the above definition of D , may also be Ci-gheterohydrocarbylene or 
Ci-gheterohydrocarbylene substituted with one or more A. In this embodiment. 



groups -D^-E'^-D^- are, in the orientation Ar^-(LM{M}p)q or Ar*-(LM{M'}p)q, as appropriate, 
-Ci^alkylene-C(0)-Ci^cycloheteroallcyIene (preferably where the hetero atom is N and is bound to 
the carboy), especially: 



Me 




25 CMcycloheteroalkylene groups are particularly preferred, e,gr. 




. Thus, preferred Lm 




30 t = 1 or more, e,g, from 1 to 50, Ito 40, 1 to 30, 1 to 20 or 1 to 10. Preferably t = 1, 2, 3, 4, 5, 6, 7, 8, 
9, or 10. 
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Preferably, Lm links one group M (or M*) to Ar^ M (or M') is linked to Lm by a single covalent bond 
and therefore no additional bonds are required (e.g. Lm{M}i may be -E^-{M}, -(D^)t-{M}, 
-(E^-D^)t-{M}, -(D^-E^)t-{M}, -E^-(D^-E%-{M} or -D^-(E^-D^)t-{M}). 

Where Lm includes a group which also falls within the definition of group M, the group M is 
5 preferably more reactive than the group included in Lm- 

Lm is preferably -(D^)t-, -(E^-D'^)t-, or -D^"(E^-D%-. 

When group Lm is -(D^)t-, t is preferably 1. is preferably Ci^alkylene, preferably methylene or 
ethylene. 

When group Lm is -(E^-D^)t-, or -D^-(E^-D^)t-, E^ is preferably (in the orientation Ar^^(LM{M}p)q 
10 or Ar*-(LM{M'}p)q, as appropriate), -C(=0)N(R^)- (e.g. -C(=0)NH-) or O (preferably O), and is 
preferably Ci^alkylene, preferably ethylene, propylene, butylene or pentylene (preferably ethylene or 
propylene), t is preferably 1. Especially preferred Lm are, in the orientation Ar^-(LM{M}p)q or 
Ar^-(LM{M*}p)q, as appropriate,, -O-CH2CH2CH2- and -O-CH2CH2CH2CH2CH2-. 

Another preferred group -D^-(E^-D^)t- is where is Ci-galkylene and t is L. Preferred E^ in this 
15 group, in the orientation Ar^-(LM{M}p)q or Ar^-(LM{M*}p)q, as appropriate, are -Z^C(=Z^)- 
(especially -N(R^)C(=0)-, e.g. -N(Me)C(=0)-) and .C(=Z^)Z^- (especially -C(=0)0-), Particularly 
preferred Lm groups are: 

M 




M 



20 



Ar' 



O 




Me 



M 



M 



The group -(E'^-D^)!- is preferred, a particularly preferred example of which is (in the orientation 
Ar*-(LM{M}p)q or Ar'-(LM{M'}p)q, as appropriate) 

-C(=0)NH-CH2CH2CH2-0-CH2CH2-0-CH2CH2-0-CH2CH2CH2-. 



In an alternative embodiment it is preferred that Lm is a single covalent bond. 
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When Ai^ is phenyl, Lm is preferably provided in a position ortho or para to C*. When Ar^ is other 
than phenyl, Lm is preferably attached to an atom which bears the charge in at least one of the 
resonance structures of the ions of formula (J). 

Where C* is a cation, Lm is preferably an electron-donating group. Where C* is an anion, Lm is 
5 preferably an electron-withdrawing group. 

Preferred examples of Lm are shown in figure lOA and lOB, 

C- - Ss, Ss- - 'Ar' and Ss- - Ar^ Bonds 

C — Ss, Ss — Ar^ and Ss — Ar^ comprise a cleavable covalent, ionic, hydrogen, dipole-dipole or van 
der Waals bond (also known as a dispersion bond or a London forces bond). The covalent, ionic, 
10 hydrogen, dipole-dipole or van der Waals bond may be direct between C and Ss, Ar^ and Ss, or Ar^ 
and Ss, or may be provided by one or more binding groups present on C and/or Ss, Ar^ and/or Ss, or 
Ar^ and/or Ss, respectively. 

Covalent Bonding 

Where the bond is covalent, the bond may be direct (e.g. C-Ss, Ar^-Ss or Ar^-Ss, respectively) or may 
15 be provided by a linker atom or group L^ (e.g. C-L'^-Ss, Ar^-L'^-Ss or Ar^-L'^-Ss, respectively). 

When L* is a linker group, preferred linker groups are -E"*-, -(D^)t"-, -(E^-D"*)!"-, -(D'^-E'^)t"-, 
-E'^-(D^.E^)t- or -D^-(E^-D*)t"-. 

D*^ is independently Ci-ghydrocarbylene or Ci^ydrocarbylene substituted with one or more A. 

E"* is, in the orientation C-L^-Ss. independently -Z\ -C(=Z^K -Z^C(=ZV, -C(=Z^)Z^-, -Z*C(=Z^)Z*., 
20 -S(=0)-, -Z^S(=0)-, -S(=0)Z\ -Z*S(=0)Z\ -S(=0>2-, -Z^S(=0)2-, -S(=0)2Z*-, -Z*S(-0)2Z*-, where 

Z^ is independently O, S or N(R^), and where R"* is independently H, Ci^hydrocarbyl (e.g. Ci^alkyl) 

or Ci^hydrocarbyl substituted with one or more A. Preferably E'* is, in the orientation C-L''-Ss, -0-, 

-S-, -C(=0)-, -C(=0)0-, -C(=S)-, -C(=S)0-, -OC(=S)-, -C(=0)S-, -SC(=0)-. -S(0)-, -S(0)2-. 

-NCR")-, -C(=0)N(R'*)-, -C(=S)N(R*)-, -N(R'')C(=0)-, -N(R'')C(=S)-. -S(=0)N(R'*)-, -N(R'*)S(=0)-, 
25 -S(=0)2N(R'*)-, -N(R'*)S(=0)2-, -0C(=0)0-, -SC(=0)0-, -OC(=0)S-, -N(R'')C(=0)0-, 

-OC(=0)N(R*)-, -N(R'*)C(=0)N(R'')-, -N(R'^)C(=S)N(R'')-, -N(R^)S(=0)N(R'*)- or - 

N(R'*)S(=0)2N(R'*)-. 

t" = 1 or more, e.g. fiom 1 to 50, Ito 40, 1 to 30, 1 to 20 or 1 to 10. Preferably t" = 1, 2, 3, 4, 5, 6, 7, 
8, 9, or 10. 

30 Where lJ includes a group which also falls within tiie definition of group M, the group M is 
preferably more reactive than the group included in jJ. 

is preferably a linker atom, preferably O or S, particularly O. 

When the solid support Ss is gold, L'* is preferably covalently attached to the Ss by a sulphide or 
disulphide group. 
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Ionic Bonding 

Where the bond is ionic, the bond is typically direct (e.g. C* Ss*, where Ss* is a solid support 
counterion to C*). 

Alternatively, it may be provided by binding groups, e.g. chelating ligands, present on C or Ss, Ar^ or 
5 Ss, or Ai^ or Ss> respectively. In the case of C — Ss bonds, the chelating ligand is typically only 
present on Ss and chelates with C*. 

Suitable chelating ligands which can bind anions include polyamines and cryptands. 

Suitable chelating ligands which can bind cations include polyacidic compounds (e.g. EDTA) and 
crown ethers. 

1 0 Hydrogen Bonding 

Where the bond is a hydrogen bond, the bond is usually provided by binding groups present on C or 
Ss, Ar^ or Ss, or Ar^ or Ss, respectively. 

Typically, in order to form the hydrogen bond, one of C or Ss, Ar^ or Ss, or Ar^ or Ss, as appropriate, 
will have a binding group bearing one or more hydroxy, amino or thio hydrogen atoms, and the other 
15 of C or Ss, Ar^ or Ss, or or Ss, respectively, will have a binding group bearing an atom having 
one or more lone pair of electrons (e.g. an oxygen, sulphur or nitrogen atom). Preferably, one of C or 
Ss, Ar* or Ss, or Ai^ or Ss, as appropriate, will have a binding group comprising biotin, and the other 
of C or Ss, Ar^ or Ss, or Ai^ or Ss, respectively, will have a binding group comprising avidin or 
streptavidin. 

20 Alternatively, the hydrogen bond may be direct. 

Dipole-Dipole Bonding 

Where the bond is a dipole-dipole bond, it may be formed between permanent dipoles or between a 
permanent dipole and an induced dipole. 

Typically, in order to form the dipole-dipole bond, one of Ss and the compound of the invention has 
25 a permanent dipole and the other of Ss and the compound of the invention has an induced dipole or a 
permanent dipole, the attraction between the dipoles forming a dipole-dipole bond. 

Preferably, Ss comprises binding groups {e.g. acid groups, -(NMes)"**, carboxy, carboxylate, 
phosphate or sulphate groups) which produce a dipole at the surface of the solid support to bind the 
compound of the invention. 

30 Van der Wools Bonding 

Where the bond is a van der Waals bond, the bonding is usually provided by binding groups present 
on C or Ss, Ar* or Ss, or Ar^ or Ss, respectively. 
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Typically, in order to form the van der Waals bond, at least one, but preferably both, of C or Ss, Ar^ 
or Ss, or Ar^ or Ss, as appropriate, will have a hydrocarbyl or heterohydrocarbyl group (usually a 
large hydrocarbyl group having at least ten carbon atoms up to about 50 carbon atoms), optionally 
substituted with one or more A. Polyfluorinated hydrocarbyl and heterohydrocarbyl groups are 
5 particularly preferred. Typically, the hydrocarbyl or heterohydrocarbyl groups are aryl or heteroaryl 
groups or groups of the formula -C(R^2Ar^5 -C(R^)(Ar^)2 or -C(Ar^)3, where Ar^ is independently 
defined the same as Ar^ and is H, Ci-g hydrocarbyl, Ci-g hydrocarbyl substituted by one or more 
A, Ci-s heterohydrocarbyl or Ci.s heterohydrocarbyl substituted by one or more A. 

A preferred binding group is tetrabenzofiillerene (formula X). 




(formula X) 



10 

Alternatively, the van der Waals bond may be direct. 
Bond Cleavage 

Preferably, the ions of formula (I) have a pKrf value of at least zz, where zz is 0 or more (e.g. 0, 1,2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14). More preferably, zz is 1 or more, still more preferably 2 or 
15 more, still more preferably 3 or more. 

Preferably, the compounds of formula (Ila), (lib), (Ilia) or (Illb) or the solid supports of formula 
(TVai), (TVaii), (TVaiii), (IVbii), (IVbiii), (IVaiv) or (IVbiv) provide ions of formula (I*) having a pKrf 
value of at least zz, where zz is defined above. 



20 C'XBonds 

The C-X bonds are cleavable by irradiation, electron bombardment, electrospray, fast atom 
bombardment (FAB), inductively coupled plasma (ICP) or chemical ionisation. Preferably, the C-X 
bonds are cleavable by irradiation or chemical ionisation. 

The term 'irradiation' includes, for example, laser illumination, in particular as used in MALDI mass 
25 spectrometry. Laser light of about 340 nm is particularly preferred because it is typically used in 
MALDI mass spectrometers. 

The tema 'electron bombardment' includes, for example, bombardment with electrons having energy 
of about 70 ev. 

Chemical ionisation can be effected, for example, by treatment with acid or acidic matrices {e.g. 
30 acidic matrices used in MALDI analysis). 
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Preferably group X is halogen, hydroxy, Ci^hydrocarbyloxy, C^shydrocarbyloxy substituted with 
one or more A, Cugheterohydrocarbyloxy, C^sheterohydrocarbyloxy substituted with one or more A, 
mesyl, tosyl, pentafluorophenyl, -O-succinimidyl -S-succinimidyl, or phenyloxy substituted with one 
or more A e.g. p-nitrophenyloxy. The groups pentafluorophenyl, -0-succinimidyl, -S-succinimidyl, 
and p-nitrophenyloxy are particularly preferred. 

Particularly preferred groups X are halogen, hydroxy, Ci-shycirocarbyloxy. Especially preferred 
groups are hydroxy, ethoxy and chloro groups. 

Other preferred groups X are alkyl ethers, e.g. : 




H3C 

O 



casi 

;or 



+0 



CH3 



» — I N 



10 H3C 

Group X may also be a -Q-oligonucleotide, where Q is O, S or N(R), where R is H, Cughydrocarbyl 
or Ci-shydrocarbyl substituted with one or more A. Q is preferably O. 

Group X may also be a nucleoside, preferably where the nucleoside is bound via its S' end, e.g. : 

O 



,CH3 



± il 

H3C 



CN 



15 In some embodiments of the invention, where Bp is an antibody (particularly where it is a 
monoclonal antibody that recognises a tumour-associated antigen), X is not: 
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or, optionally, X is not any other 2,6-diaminopurine nucleoside prodrug group. 

In some embodiments of the invention, X is not H. If X is H, preferably at least one of Ar^ and Ar^ is 
polycyclic, heterocyclic or unsubstituted. 

5 Preferred examples of group X are shown in figure 13. 
Ionic C ^X* Bonds 

X* is any counterion for forming salts with compounds of the invention. 

X* includes ions having single charges and multiple charges. Typically ions having multiple 
charges will be associated with an appropriate number of compounds of formula (lib), (Illb), (IVbii), 
10 (IVbiii), (rVbiv), (Vbii), (Vbiii) or (Vbiv) in order balance the charge. Ions having multiple charges 
include doubly charged ions (e.g. S04^ ) and triply charged ions. X* preferably has a single charge. 

The counterion X* may be dissociated from the derivative of formula (lib), (Illb), (IVbii), (IVbiii), 
(IVbiv), (Vbii), (Vbiii) or (Vbiv) by irradiation, electron bombardment, electrospray, fast atom 
bombardment (FAB), inductively coupled plasma (ICP) or chemical ionisation. Preferably, the 
15 counterion X* may be dissociated by irradiation. 

When X* is a cation, X* is preferably H*. 

When X* is an anion, X* is preferably, BFe" or CIO4'. 

It is preferred that X* is an anion. 

Preferred examples of group X* are shown in figure 13, 

20 C—S& Ss"'Ar^ orSs—Ai^ 

The C Ss, Ss ^Ar^ or Ss- - -Ar^ bonds are cleavable by irradiation, electron bombardment, 

electrospray, fast atom bombardment (FAB), inductively coupled plasma (ICP) or chemical 
ionisation. Preferably, the C- - -Ss, Ss- - -Ar* or Ss- - -Ar^ bonds are cleavable by irradiation or chemical 
ionisation. 

25 AVhere appropriate, the C Ss, Ss- - -Ar' or Ss- - -Ai^ bonds may be cleaved simultaneously or 

sequentially with the cleaving of the C-X bond or the dissociation of X*, as appropriate, by 
selection of suitable cleaving/dissociating conditions. 

In one embodiment of the invention, the C — Ss bond in the solid support of formula (Vai) may be 
cleaved in sub-steps of step (iia) so that in a first sub-step a residue X (where X is the leaving group 

-30- 



wo 2005/057207 PCT/GB2004/005140 

defined above) is provided and in a second subsequent sub-step the C-X bond is cleaved thereby 
forming the ion of formula (I). If desired, the second sub-step may be carried out substantially (e.g. 
seconds, minutes, hours or even days) after the first sub-step. 

Ar^ andAf^ 
5 Af^ 

Ar^ is independently an aromatic group or an aromatic group substituted with one or more A and is 
preferably independently cyclopropyl, cyclopropyl substituted with one or more A, aryl, aryl 
substituted with one or more A, heteroaryl, or heteroaryl substituted with one or more A. 

Where aryl or substituted aryl, Ar^ is preferably Cs-so aryl or substituted C6.30 aryl. Where heteroaryl 
10 or substituted heteroaryl, Ar^ is preferably Cs^jo heteroaryl or substituted C6-30 heteroaryl. 

Examples of aryl and heteroaryl are monocyclic aromatic groups (e.g, phenyl or pyridyl), fused 
polycyclic aromatic groups (e.g. napthyl, such as 1-napthyl or 2-napthyl) and unfused polycyclic 
aromatic groups (e.g. monocyclic or fused polycyclic aromatic groups linked by a single bond, a 
double bond, or by a -(CH=CH)r- linking group, where r is one or more (e,g. 1, 2, 3, 4 or 5). 

15 Other examples of aryl groups are monovalent derivatives of aceanthrylene, acenaphthylene, 
acephenanthrylene, anthracene, azulene, chrysene, coronene, fluoranthene, fluorene, as-indacene, 
indacene, indene, naphthalene, ovalene, perylene, phenalene, phenanthrene, picene, pleiadene, 
pyrene, pyranthrene and rubicene, which groups may be optionally substituted by one or more A. 

Other examples of heteroaryl groups are monovalent derivatives of acridine, carbazole, )S-carboline, 
20 chromene, cinnoline, furan, imidazole, indazole, indole, indolizine, isobenzofuran, isochromene, 
isoindole, isoquinoline, isothiazole, isoxazole, naphthyridine, perimidine, phenanthridine, 
phenanthroline, phenazine, phthalazine, purine, pyran, pyrazine, pyrazole, pyridazine, pyridine, 
pyrimidine, pyrrole, pyrrolizine, quinazoline, quinoline, quinolizine, quinoxaline, thiophene and 
xanthene, which groups may be optionally substituted by one or more A. Preferred heteroaryl groups 
25 are five- and six-membered monovalent derivatives, such as the monovalent derivatives of fiiran, 
imidazole, isothiazole, isoxazole, pyran, pyrazine, pyrazole, pyridazine, pyridine, pyrimidine, 
pyrrole, pyrrolizine and thiophene. The five-membered monovalent derivatives are particularly 
preferred, i\e. the monovalent derivatives of furan, imidazole, isothiazole, isoxazole, pyrazole, 
pyrrole and thiophene. The heteroaryl groups may be attached to the remainder of the compound by 
30 any carbon or hetero (e.g. nitrogen) atom. 

Ar^ is preferably Ce-soaryl substituted by one or more A, preferably phenyl or napthyl (e,g. 1-napthyl 
or 2-napthyl, especially 2-napthyl) substituted by one or more A, more preferably phenyl substituted 
by one or more A. When Ar^ is phenyl, A is preferably provided in a position ortho or para to C*. 
When Ar^ is other than phenyl, A is preferably attached to an atom which bears the charge in at least 
35 one of the resonance structures of the ions of formula (I). 
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Fused polycyclic aromatic groups, optionally substituted with one or more A, are particularly 
preferred. 

A particularly preferred Ar^ is unsubstituted pyrenyl or pyrenyl substituted with one or more A. 
Unsubstituted pyrenyl is preferred. The pyrenyl group may be 1 -pyrenyl, 2-pyrenyl or 4-pyrenyl. 

5 Preferred heteroaryl Ai^ groups, whether substituted or unsubstituted, are pyridyl, pyrrolyl, thienyl 
and fiiryl, especially thienyl. 

A preferred Ar^ group is thiophenyl or thiophenyl substituted with one or more A. Unsubstituted 
thiophenyl is preferred. Examples of thiophenyl are thiophen-2-yl and thiophen-3-yl, with thiophen- 
2-yl being especially preferred. 

10 When substituted, Ar^ is preferably substituted by 1, 2 or 3 A. Ar^ is preferably: 

OMe 

„.^^3^_ 

When unsubstituted, Ai^ is preferably: 




15 In another preferred embodiment, Ar^ is cyclopropyl or cyclopropyl substituted with one or more A. 
Unsubstituted cyclopropyl is preferred. One or more, preferably one, of Ar^ may be cyclopropyl. 

Preferred examples of group Ar^ are shown in figures 12A and 12B. 
Ar' 

Ar^ is independently an aromatic group or an aromatic group substituted with one or more A. The 
20 definition of Ar^ is the same as Ar^ (as defined above), except that the valency of the group Ar^ is 
adapted to accommodate the q instances of the linker Lm- Preferred Ar^ groups are also preferred Ar^ 
groups, (as defined above), except that the valency of the group Ar' is adapted to accommodate the q 
instances of the linker Lm- 

When q = 1, Ar' is a divalent radical and is preferably independently cyclopropylene, cyclopropylene 
25 substituted with one or more A, arylene, arylene substituted with one or more A, heteroarylene, or 
heteroarylene substituted with one or more A. 
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Where arylene or substituted arylene, Ar' is preferably Ce-so arylene or substituted Cs-so arylene. 
Where heteroarylene or substituted heteroarylene, Ar^ is preferably C^o heteroarylene or substituted 
C6-30 heteroarylene. 

Examples of arylene and heteroarylene are monocyclic aromatic groups {e,g, phenylene or 
5 pyridylene), fused polycyclic aromatic groups {e,g, napthylene) and unfiised polycyclic aromatic 
groups (e.g. monocyclic or fused polycyclic aromatic groups linked by a single bond, a double bond, 
or by a-(CH=CH)r linking group, where r is one or more (e.g. 1, 2, 3, 4 or 5). 

Other examples of arylene groups are polyvalent derivatives (where the valency is adapted to 
accommodate the q instances of the linker Lm) of aceanthrylene, acenaphthylene, acephenanthrylene, 
10 anthracene, azulene, chrysene, coronene, fluoranthene, fluorene, cr^-indacene, 5-indacene, indene, 
naphthalene, ovalene, perylene, phenalene, phenanthrene, picene, pleiadene, pyrene, pyranthrene and 
rubicene, which groups may be optionally substituted by one or more A. 

Other examples of heteroarylene groups are polyvalent derivatives (where the valency is adapted to 
accommodate the q instances of the linker Lm) of acridine, carbazole, )9-carboline, chromene, 

15 cinnoline, furan, imidazole, indazole, indole, indolizine, isobenzofiiran, isochromene, isoindole, 
isoquinoline, isothiazole, isoxazole, naphthyridine, perimidine, phenanthridine, phenanthroline, 
phenazine, phthalazine, purine, pyran, pyrazine, pyrazole, pyridazine, pyridine, pyrimidine, pyrrole, 
pyrrolizine, quinazoline, quinoline, quinolizine, quinoxaline, thiophene and xanthene, which groups 
may be optionally substituted by one or more A. Preferred heteroaryl groups are five- and six- 

20 membered polyvalent derivatives, such as the polyvalent derivatives of furan, imidazole, isothiazole, 
isoxazole, pyran, pyrazine, pyrazole, pyridazine, pyridine, pyrimidine, pyrrole, pyrrolizine and 
thiophene. The five-membered polyvalent derivatives are particularly preferred, /.e. the polyvalent 
derivatives of fiiran, imidazole, isothiazole, isoxazole, pyrazole, pyrrole and thiophene. The 
heteroaryl groups may be attached to the remainder of the compound by any carbon or hetero (e.g. 

25 nitrogen) atom. 

Ar^ is preferably Ce-aoarylene substituted by one or more A, preferably phenylene or napthylene 
substituted by one or more A, more preferably phenylene substituted by one or more A. When Ar^ is 
phenylene, A is preferably provided in a position ortho or para to C*. When Ar^ is other than 
phenylene, A is preferably attached to an atom which bears the charge in at least one of the resonance 
30 structures of the ions of formula (I). 

When substituted, Ar' is preferably substituted by 1, 2 or 3 A. 

When unsubstituted, preferred Ar' are: 
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Preferred examples of group Ar^ are shown in figures 12 A and 12B. 
Combinations ofAr 

Optionally two or three of the groups Ar^ and Ar^ are linked together by one or more L^, where is 
5 independently a single bond or a linker atom or group; and/or two or three of the groups Ar^ and Ar^ 
together form an aromatic group or an aromatic group substituted with one or more A. 

When is a linker group, preferred linker groups are -E^-, -(D^)t'-, -(E^-D^)t'-, -(D^-E^)t-, 
.E^^^-E^)t- or -D^-(E^-D^)t-. 

is independently Ci^ghydrocarbylene or Ci-ghydrocarbylene substituted with one or more A. 

10 EMs independently -C(=Z^)-, -Z^C(=Z^)-, -C(=Z^)Z^-, -Z'C(=Z^)Z'-, -S(=0)-, -Z^S(=0)-, 

-S(=0)Z^-, -Z^S(=0)Z^-, -S(=0)2-, -Z^S(=0)2-, -S(=0)2Z^-, -Z^S(=0)2Z^-, where Z^ is independently 
O, S or N(R^) and where is independently H, Cughydrocarbyl or Ci-ghydrocarbyl substituted with 
one or more A. Preferably is -0-, -C(=0)-, -C(=0)0-, -C(=S)-, -C(=S)0-, -OC(=S)-, 
-C(=0)S-, -SC(=0)-, -S(0)-, -S(0)2-, -N(R>, -.CC-ONCR")-, •C(=S)N(R^)-, -N(R^)C(=0)-, 

15 -N(R')C(=S)-, -S(=0)N(R')-, -N(R')S(=0)., -S(=0)2N(R')-, -N(R')S(-0)2-, -0C(=0)0-, 
-SC(=0)0-, -OC(=0)S-, -N(R^)C(=0)0-, -OC(=0)N(R^)-, -N(R^)C(=0)N(R^)., -N(R^)C(=S)N(R^)-, 
-N(R^)S(=0)N(R^)- or-N(R^)S(=0)2N(R^)-. 

f = 1 or more, e,g. from 1 to 50, Ito 40, 1 to 30, 1 to 20 or 1 to 10. Preferably t* = 1, 2, 3, 4, 5, 6, 7, 8, 
9, or 10. Most preferably f=l. 

20 Where includes an atom or group which also falls within the definition of group M, the group M is 
preferably more reactive than the group included in L^. 

is preferably a linker atom, preferably O or S, particularly O. 

When is a linker group, a preferred is -N(R^)-. 

In another embodiment in which is a linker group, is -S(=0)-. 

25 When two of the groups Ar* and Ar^ are linked together by one or more {e.g, 2, 3 or 4) L^, they are 
preferably linked together by one L^, preferably O. 

Preferred combinations of Ar are two Ar^ (e.g. two Ar^ phenyl groups) linked together by one 
(e.g. O or S). 
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15 



20 



25 



Particularly preferred combinations of Ar are two Ar^ phenyl groups, optionally substituted by one or 
more A (preferably unsubstituted), linked together by one (e.g. O or S), where is is ortho to C* 
with respect to both phenyl groups. Especially preferred combinations of two Ar^ groups are: 




MeO 




and 

5 In another embodiment, at least one Lm is linked to an atom or group L^. In this embodiment, the 
preferred mentioned above are, where appropriate, modified to remove substituents in order to 
accommodate Lm, e.g. the substituent of the group -N(R^)- is replaced by Lm- In this embodiment. 



the group to which Lm is bound is preferably: 



— N 



/ 



\ 

ArVAr^ 

10 Preferred combinations of Ar* and/or Ai^ in this embodiment are: 




OMe 

When two or three of the groups Ar^ and Ai^ together form an aromatic group or an aromatic group 
substituted with one or more A, the aromatic group may be a carbocyclic aromatic group or a 
carbocyclic aromatic group in which one or more carbon atoms are each replaced by a hetero atom. 
Typically, in an aromatic group in which one or more carbon atoms are each replaced by a hetero 
atom, up to three carbons are so replaced, preferably up to two carbon atoms, more preferably one 
carbon atom. 

Preferred hetero atoms are O, Se, S or N, more preferably O, S or N. 

When two or three of the groups Ar^ and At^ together form an aromatic group or an aromatic group 
substituted with one or more A, preferred aromatic groups are Csso aromatic groups. 

The aromatic groups may be monocyclic aromatic groups (e.g. radicals of suitable valency derived 
from benzene), fused polycyclic aromatic groups (e.g. radicals of suitable valency derived from 
napthalene) and unfused polycyclic aromatic groups (e.g. monocyclic or fused polycyclic aromatic 
groups linked by a single bond, a double bond, or by a -(CH=CH)r linking group, where r is one or 
more (e.g. 1, 2, 3, 4 or 5). 
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When two or three of the groups Ar^ and Ar^ together form a carbopolycyclic fused ring aromatic 
group, preferred groups are radicals of suitable valency obtained from napthalene, anthracene or 
phenanthracene, chrysene, aceanthrylene, acenaphthylene, acephenanthrylene, azulene, fluoranthene, 
fluorene, oj-indacene, ^-indacene, indene, phenalene, and pleiadene. 

5 When two or three of the groups Ar' and Ar^ together form a carbopolycyclic fused ring aromatic 
group in which one or more carbon atoms are each replaced by a hetero atom, preferred groups are 
radicals of suitable polyvalency obtained from acridine, carbazole, P-carboline, chromene, cinnoline, 
indole, indolizine, isobenzofuran, isochromene, isoindole, isoquinoline, naphthyridine, perimidine, 
phenanthridine, phenanthroline, phenazine, phthalazine, pteridine, purine, pyrrolizine, quinazoline, 
10 quinoline, quinolizine and quinoxaline. 

Substitution ofAr^ and Ar^ — Anions and Cations 

When C* is a cation, A is preferably an electron-donating group, including -R* or -Z^R^ where R* 
and 7} are defmed below. Preferably, R' is Ci-ghydrocarbyl, more preferably Ci^alkyl, especially 
methyh 2} is preferably O, S or NR\ R' may be substituted with one or more Sub^, but is preferably 
15 unsubstituted. When C* is a cation, A is preferably -OMe, -SMe, -N(Me)2 or Me. When C* is a 
cation. A, when an electron-donating group, is preferably provided (especially in relation to Ar' or 
Ajc^ being phenyl) in a position ortho or para to C*, preferably para. Furthermore, when C* is a 
cation. A, when an electron-withdrawing group (e.g, F), is preferably provided (especially in relation 
to Ar* or Ai^ being phenyl) in a position meta to C*. Thus, preferred groups Ar' and Ar* are as 



When C* is an anion, A is preferably an electron-withdrawing group, including halogen, 
trihalometiiyl, .NO2, -CN, -]Sr(R')20-, -CO2H, -C02R^ -SO3H, -SOR^ -S02R^ -SOjRS 

25 .OC(=0)OR', -C(=0)H, -C(=0)R', -0C(=0)R', -C(=0)NH2, -C(=0)NR^2, -N(R^)C(=0)OR^ 
-N(R^)C(=0)NR^2, -0C(=0)NR^2, -N(R^)C(=0)R\ -C(=S)NR^, -NR^C(=S)R^ -S02NR*2, 
-NR'S02R^ -N(R*)C(=S)NR'2, or .N(R^)S02NR'2, where R* is defined below. When C* is an 
anion. A, when an electron-withdrawing group, is preferably provided (especially in relation to Ar^ 
or Ar^ being phenyl) in a position ortho or para to C*, preferably para. Furthermore, when C* is an 

30 anion. A, when an electron-donating group, is preferably provided (especially in relation to Ar^ or 
Ar^ being phenyl) in a position meta to C* . 

The group A may also comprise one or more isotopes of the atoms making up group A {e.g. example 
60), thus, as discussed in more detail below, allowing the masses of the compounds of the invention 



20 follows: 



F 
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to be varied. Preferred isotopes are *^C, and ^H. When providing a series of compounds which 
differ only in their masses, and "^O are particularly preferred as atoms may cause a substantial 
change in the chemical properties of the compound due to the kinetic isotope effect. 

Solid Supports 

5 ^ Solid supports* for use with the invention include polymer beads, metals, resins, columns, surfaces 
(including porous surfaces) and plates {e.g. mass-spectrometry plates). 

The solid support is preferably one suitable for use in a mass spectrometer, such that the invention 
can be conveniently accommodated into existing MS apparatus. lonisation plates from mass 
spectrometers are thus preferred solid supports, e,g, gold, glass-coated or plastic-coated plates. Solid 
10 gold supports are particularly preferred. 

Resins or colimms, such as those used in affinity chromatography and the like, are particularly useful 
for receiving solutions of biopolymers (purified or mixtures). For example, a cellular lysate could be 
passed through such a column of formula (IVai), (TVaii), (IVaiii), (IVaiv), (IVbii), (IVbiii) or (IVbiv) 
followed by cleavage of the support to leave compounds of formula (I). 

15 Solid supports of formulae (TVai), (IVaii), (IVaiii), (IVaiv), (IVbii), (IVbiii) or (IVbiv) will generally 
present exposed groups M capable of reacting with a biopolymer. Bp. For MS analysis, ions 
preferably have a predictable mass to charge (m/e) ratio. If a biopolymer reacts with more than one 
M group, however, then it will carry more than one positive charge once ionised, and its m/e ratio 
will decrease. Advantageously, therefore, the groups M are arranged such that any biopolymer 

20 molecule will covalently link with only a single group M. Consequently, each biopolymer will, on 
ionisation, carry a single positive charge and thus have a predictable mass to charge ratio. 

Typically, the surface density of the solid supports of (IVai), (IVaii), (IVaiii), (IVaiv), (IVbii), 
(IVbiii) or (FVbiv) will be provided so that a biopolymer molecule can only covalently link with one 
group M and thus to prevent the formation of multiply derivatised biopolymers. 

25 Varying the mass of compounds of the invention 

Within the general formulae (I), (Ila), (lib), (Ilia), (Illb), (IVai), (TVaii), (IVaiii), (IVaiv), (IVbii), 
(IVbiii), (TVbiv), (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) and (Vbiv), there is much scope for 
variation. There is thus much scope of variation in the mass of these compounds. In some 
embodiments of the invention, it is preferred to use a series of two or more {e,g, 2, 3, 4, 5, 6 or more) 

30 compounds with different and defined molecular masses. 

The masses of the compounds of the invention can be varied via Lm, Ar^ and/or Ar^. Preferably, the 
masses of the compounds of the invention are varied by varying A on the groups Ar' and/or Ar^. 

In this aspect of invention, compounds of the invention advantageously comprise one or more of F or 
I as substituents A of the groups Ar\ Ar^ or Ar^. F and I each only have one naturally occurring 
35 isotope, *^F and '^^I respectively, and thus by varying the number of F and I atoms present in the 
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Structure of the compounds, can provide a series of molecular mass labels having substantially 
identical shaped peaks on a mass spectrum. 

Compounds of the invention may also include one or more atoms, preferably as a substituent A or 
a part thereof of the groups Lm, Ar\ Ar^ or Ar^ (in particular Lm), in order to vary the masses of the 
5 compounds of the invention. The compounds of the invention may include isotopes of '^C and ^^O, 
prefererably as a substituent A or a part thereof of the groups Lm> Ar\ Ar^ or Ar^ (in particular Ar\ 
Ar^ or Ar^), in order to vary the masses of the compounds of the invention. Compounds comprising 
^H, *^C and '^O may also be used to provide a series of molecular mass labels having substantially 
identical shaped peaks on a mass spectrum, by varing the number of ^H, ^^C and atoms present in 
10 the structure of the compounds. When providing a series of compounds which differ only in their 
masses, ^*^C and ^^O are particularly preferred as atoms may cause a substantial change in the 
chemical properties of the compound due to the kinetic isotope effect. 

In order to increase the molecular mass of the compounds of the invention and to increase the 
number of available sites for substitution by A, especially F and I, one or more of Ar^ and Ai^ may 
15 be substituted by one or more dendrimer radicals of appropriate valency, either as substituent A or 
group Lm. 

Preferred dendrimer radicals are the radicals obtained from the dendrimers of US 6,455,071 and 
PAMAM dendrimers. 

The compounds of the invention may advantageously be used in the method of analysing a 
20 biopolymer disclosed herein, in particular in a method for following a reaction involving a 
biopolymer. Bp, since the abundance of a species of may be determined by mass spectrometry by 
measuring the intensity of the relevant peak in an obtained mass spectrum. 

Specifically, there is provided a method for analysing biopolymer Bp, comprising the steps of: 

(i) reacting a first sample comprising biopolymer Bp with a compound of formula (Ila) 
25 or (lib) or a solid support of formula (IVai), (IVaii), (IVaiii), (TVaiv), (TVbii), (IVbiii) or (IVbiv) at a 

time ti; 

(ii) reacting a second sample comprising biopolymer Bp with a compound of formula 
(Ila) or (lib) or a solid support of formula (IVai), (IVaii), (IVaiii), (TVaiv), (TVbii), (TVbiii) or (TVbiv) 
at a later time t2; 

30 (iii) preparing and analysing cations of formula (I) from the first and second samples; and 

(iv) comparing the results of the analysis from step (iii). 

If levels of the biopolymer Bp decrease between times ti and t2 then there will be a decrease in 
detected ion; if levels of the biopolymer Bp increase between times ti and t2 then there will be an 
increase in detected ion. Tlie effects of stimuli on transcription and/or translation can therefore be 
35 monitored. 
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Advantageously, different compounds of formula (Ila) or (lib) or different solid supports of formula 
(IVai), (TVaii), (IVaiii), (TVaiv), (IVbii), (IVbiii) or (IVbiv) are used at different times in order to 
facilitate simultaneous and parallel analysis of the first and second samples. For example, if the two 
compounds used at times ti and t2 differ only by a 'H to ^^F substitution then the relative abundance 
5 of Bp at the two times can be determined by comparing peaks separated by 1 8 units. 

Advantageously, the reaction of the biopolymer with the compound of formula (Ila) or (lib) or the 
solid support of formula (IVai), (FVaii), (IVaiii), (IVaiv), (IVbii), (IVbiii) or (TVbiv) will fix the 
biopolymer to prevent it reacting further and the steps of providing and analysing the cations may be 
canried out at a later convenient time. Alternatively, if the reaction of the biopolymer with the 
10 compound of formula (Ha) or (lib) or the solid support of formula (IVai), (IVaii), (IVaiii), (TVaiv), 
(TVbii), (IVbiii) or (IVbiv) does not quench the reaction of the biopolymer being followed, a cation 
of formula (I) from the reaction product of step (i) or step (v) should be obtained as soon as possible 
after reaction of the biopolymer with the compound of formula (Ua) or (lib) or the solid support of 
formula (IVai), (IVaii), (IVaiii), (IVaiv), (IVbii), (IVbiii) or (IVbiv). 

1 5 Compounds of Formulae (Ua) and (lib) 

The compounds of formulae (Ila) or (lib) are available commercially or may be synthesised by 
known techniques. 

Commercially available compounds of formulae (Ila) or (lib) are disclosed, for example in the 
Molecular Probes Catalogue, 2002. Commercially available trityls, and derivatives and analogues 
20 thereof, may also be derivatised vsdth the groups (LM{M}p)q by known techniques. 

Methods for synthesis of compounds of formula (Ila) or (lib) useful in the present invention are 
described in Chem. Soc. Rev. (2003) 32, p. 3-13, scheme 2 and "1. introduction'*, last two 
paragraphs. Groups (LM{M}p)q are usually introduced into the intermediates and the compounds are 
then assembled using the appropriate pathways. Alternatively, the groups (LM-{M}p)q may be added 
25 after assembly of the aromatic groups and a-carbon of the compounds. 

Methods for synthesis of compounds of formulae (Ua) or (lib) are also described in WO99/60007. 

Further methods for synthesising the compounds of formulae (lla) or (lib) are described in European 
patent application 04 104 60S .3. 

Preferred compounds of formula (Ila), (lib) and (IVai) are: 

OMe 



30 
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Chemical Groups 

The ions of the invention are stabilised by the resonance effect of the aromatic groups Ar^ and Ar^. 
5 The term 'C* is a carbon atom bearing a single positive charge or a single negative charge* 
therefore not only includes structures having the charge localised on the carbon atom but also 
resonance structures in which the charge is delocalised from the carbon atom. 

The term 'linker atom or group' includes any divalent atom or divalent group. 

The term 'aromatic group* includes quasi and/or pseudo-aromatic groups, e.g. cyclopropyl and 
10 cyclopropylene groups. 

The term 'halogen' includes fluorine, chlorine, bromine and iodine. 

The term 'hydrocarbyl' includes linear, branched or cyclic monovalent groups consisting of carbon 
and hydrogen. Hydrocarbyl groups thus include alkyl, alkenyl and alkynyl groups, cycloalkyl 
(including polycycloalkyi), cycloalkenyl and aryl groups and combinations thereof, e.g. 
15 alkylcycloalkyl, alkylpolycycloalkyl, alkylaiyl, alkenylaryl, cycloalkylaryl, cycloalkenylaryl, 
cycloalkylalkyl, polycycloalkylalkyi, arylalkyl, arylalkenyl, arylcycloalkyl and arylcycloalkenyl 
groups. Preferred hydrocarbyl are Clh hydrocarbyl, more preferably Ci^ hydrocarbyl. 

Unless indicated explicitly otherwise, where combinations of groups are referred to herein as one 
moiety, e.g. arylalkyl, the last mentioned group contains the atom by which the moiety is attached to 
20 the rest of the molecule. 

The term *hydrocarbylene' includes linear, branched or cyclic divalent groups consisting of carbon 

and hydrogen formally made by the removal of two hydrogen atoms from the same or diiBFerent 

(preferably dijBFerent) skeletal atoms of the group. Hydrocarbylene groups thus include alkylene, 
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alkenylene and alkynylene groups, cycloalkylene (including polycycloalkylene), cycloalkenylene and 
arylene groups and combinations thereof, e,g, alkylenecycloalkylene, alkylenepolycycloalkylene, 
alkylenearylene, alkenylenearylene, cycloalkylenealkylene, polycycloalkylenealkylene, 
arylenealkylene and arylenealkenylene groups. Preferred hydrocarbylene are Cum hydrocarbylene, 
5 more preferably Ci-g hydrocarbylene. 

The term 'hydrocarbyloxy' means hydrocarbyl-O-. 

The terms *alkyr, *alkylene*, *alkenyr, *alkenylene', 'alkynyP, or ^alkynylene' are used herein to 
refer to botli straight, cyclic and branched chain forms. Cyclic groups include C3.8 groups, preferably 
C5.8 groups. 

10 The term *alkyr includes monovalent saturated hydrocarbyl groups. Preferred alkyl are C1.8, more 
preferably C1-4 alkyl such as methyl, ethyl, n-propyl, i-propyl ort-butyl groups. 

Preferred cycloalkyl are C5.8 cycloalkyl. 

The term 'alkoxy' means alkyl-O-. 

The term 'alkenyP includes monovalent hydrocarbyl groups having at least one carbon-carbon 
15 double bond and preferably no carbon-carbon triple bonds. Preferred alkenyl are C2A alkenyl. 

The term *alkynyr includes monovalent hydrocarbyl groups having at least one carbon-carbon triple 
bond and preferably no carbon-carbon double bonds. Preferred alkynyl are C2-4 alkynyl. 

The term *aryr includes monovalent aromatic groups, such as phenyl or naphthyl. In general, the aryl 
groups may be monocyclic or polycyclic fused ring aromatic groups. Preferred aryl are Ce-Cnaryl. 

20 Other examples of aryl groups are monovalent derivatives of aceanthrylene, acenaphthylene, 
acephenanthrylene, anthracene, azulene, chrysene, coronene, fluoranthene, fluorene, <ar^-indacene, s- 
indacene, indene, naphthalene, ovalene, perylene, phenalene, phenanthrene, picene, pleiadene, 
pyrene, pyranthrene and rubicene. 

The term 'alkylene' includes divalent saturated hydrocarbylene groups. Preferred alkylene are Cm 
25 alkylene such as methylene, ethylene, n-propylene, i-propylene or t-butylene groups. 

Preferred cycloalkylene are C5-8 cycloalkylene. 

The term 'alkenylene' includes divalent hydrocarbylene groups having at least one carbon-carbon 
double bond and preferably no carbon-carbon triple bonds. Preferred alkenylene are C2A alkenylene. 

The term 'alkynylene' includes divalent hydrocarbylene groups having at least one carbon-carbon 
30 triple bond and preferably no carbon-carbon double bonds. Preferred alkynylene are C2-4 alkynylene. 

TTie term 'arylene' includes divalent aromatic groups, such phenylene or naphthylene. In general, the 
arylene groups may be monocyclic or polycyclic fused ring aromatic groups. Preferred arylene are 
C6-Ci4arylene. 
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Other examples of arylene groups are divalent derivatives of aceanthrylene, acenaphthylene, 
acephenanthrylene, anthracene, azulene, chrysene, coronene, fluoranthene, fluorene, o^-indacene, s- 
indacene, indene, naphthalene, ovalene, perylene, phenalene, phenanthrene, picene, pleiadene, 
pyrene, pyranthrene and rubicene. 

5 The term 'heterohydrocarbyr includes hydrocarbyl groups in vsfhich up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
independently by O, S, Se or N, preferably O, S or N. Heterohydrocarbyl groups thus include 
heteroalkyl, heteroalkenyl and heteroalkynyl groups, cycloheteroalkyl (including 
polycycloheteroalkyl), cycloheteroalkenyl and heteroaryl groups and combinations thereof, e.g, 

10 heteroalkylcycloalkyl, alkylcycloheteroalkyl, heteroalkylpolycycloalkyl, alkylpolycycloheteroalkyl, 
heteroalkylaryl, alkylheteroaryl, heteroalkenylaryl, alkenylheteroaryl, cycloheteroalkylaryl, 
cycloalkylheteroaryl, heterocycloalkenylaryl, cycloalkenylheteroaryl, cycloalkylheteroalkyl, 
cycloheteroalkylalkyl, polycycloalkylheteroalkyl, polycycloheteroalkylalkyl, arylheteroalkyl, 
heteroarylalkyl, arylheteroalkenyl, heteroarylalkenyl, arylcycloheteroalkyl, heteroarylcycloalkyl, 

15 arylheterocycloalkenyl and heteroarylcycloalkenyl groups. The heterohydrocarbyl groups may be 
attached to the remainder of the compound by any carbon or hetero (e.g. nitrogen) atom. 

The term 'heterohydrocarbylene' includes hydrocarbylene groups in which up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
independently by O, S, Se or N, preferably O, S or N. Heterohydrocarbylene groups thus include 

20 heteroalkylene, heteroalkenylene and heteroalkynylene groups, cycloheteroalkylene (including 
polycycloheteroalkylene), cycloheteroalkenylene and heteroarylene groups and combinations thereof, 
e.g. heteroalkylenecycloalkylene, alkylenecycloheteroalkylene, heteroalkylenepolycycloalkylene, 
alkylenepolycycloheteroalkylene, heteroalkylenearylene, alkyleneheteroarylene, 

heteroalkenylenearylene, alkenyleneheteroarylene, cycloalkyleneheteroalkylene, 

25 cycloheteroalkylenealkylene, polycycloalkyleneheteroalkylene, polycycloheteroalkylenealkylene, 
aryleneheteroalkylene, heteroarylenealkylene, aryleneheteroalkenylene, heteroarylenealkenylene 
groups. The heterohydrocarbylene groups may be attached to the remainder of the compound by any 
carbon or hetero (e.g. nitrogen) atom. 

Where reference is made to a carbon atom of a hydrocarbyl or other group being replaced by an O, S, 
30 Se or N atom, what is intended is that: 



— CH— 



is replaced by 




~-CH= is replaced by -N=; or 



-CH2- is replaced by -0-, -S- or -Se-. 
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The term *heteroalkyr includes alkyl groups in which up to three carbon atoms, preferably up to two 
carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or N, 
preferably O, S orN. 

The term 'heteroalkenyF includes alkenyl groups in which up to three carbon atoms, preferably up to 
5 two carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or 
N, preferably O, S orN. 

The term 'heteroalkynyl' includes alkynyl groups in which up to three carbon atoms, preferably up to 
two carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or 
N, preferably O, S orN. 

10 The term *heteroaryr includes aryl groups in which up to three carbon atoms, preferably up to two 
carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or N, 
preferably O, S or N. Preferred heteroaryl are Cs-uheteroaryL Examples of heteroaryl are pyridyl, 
pyrrolyl, thienyl or furyl. 

Other examples of heteroaryl groups are monovalent derivatives of acridine, carbazole, )ff-caiboline, 
15 chromene, cinnoline, furan, imidazole, indazole, indole, indolizine, isobenzofuran, isochromene, 
isoindole, isoquinoline, isothiazole, isoxazole, naphthyridine, perimidine, phenanthridine, 
phenanthroline, phenazine, phthalazine, purine, pyran, pyrazine, pyrazole, pyridazine, pyridine, 
pyrimidine, pyrrole, pyrrolizine, quinazoline, quinoline, quinolizine, quinoxaline, thiophene and 
xanthene. Preferred heteroaryl groups are five- and six-membered monovalent derivatives, such as 
20 the monovalent derivatives of furan, imidazole, isothiazole, isoxazole, pyran, pyrazine, pyrazole, 
pyridazine, pyridine, pyrimidine, pyrrole, pyrrolizine and thiophene. The five-membered monovalent 
derivatives are particularly preferred, /.e. the monovalent derivatives of furan, imidazole, isothiazole, 
isoxazole, pyrazole, pyrrole and thiophene. 

The term 'heteroalkylene' includes alkylene groups in which up to three carbon atoms, preferably up 
25 to two carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se 
or N, preferably O, S or N. 

The term 'heteroalkenylene* includes alkenylene groups in which up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
independently by O, S, Se or N, preferably O, S or N. 

30 The term 'heteroaikynylene' include alkynylene groups in which up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
independently by O, S, Se or N, preferably O, S or N. 

The term *heteroarylene' includes arylene groups in which up to three carbon atoms, preferably up to 
two carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or 
35 N, preferably O, S or N. Preferred heteroarylene are C5.i4heteroarylene. Examples of heteroarylene 
are pyridylene, pyrrolylene, thienylene or fiurylene. 
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Other examples of heteroarylene groups are divalent derivatives (where the valency is adapted to 
accommodate the q instances of the linker Lm) of acridine, carbazole, )S-carboline, chromene, 
cinnoline, furan, imidazole, indazole, indole, indolizine, isobenzoforan, isochromene, isoindole, 
isoquinoline, isothiazole, isoxazole, naphthyridine, perimidine, phenanthridine, phenanthroline, 
5 phenazine, phthalazine, purine, pyran, pyrazine, pyrazole, pyridazine, pyridine, pyrimidine, pyrrole, 
pyrrolizine, quinazoline, quinoline, quinolizine, quinoxaline, thiophene and xanthene. Preferred 
heteroarylene groups are jHve- and six-membered divalent derivatives, such as the divalent 
derivatives of furan, imidazole, isothiazole, isoxazole, pyran, pyrazine, pyrazole, pyridazine, 
pyridine, pyrimidine, pyrrole, pyrrolizine and thiophene. The five-membered divalent derivatives are 
10 particularly preferred, i.e. the divalent derivatives of fiiran, imidazole, isothiazole, isoxazole, 
pyrazole, pyrrole and thiophene. 

Substitution 

A is independently a substituent, preferably a substituent Sub^ Alternatively, A may be ^H. 

Sub is independently halogen, trihalomethyl, -NO2, -CN, -lsr'(R^)20% -CO2H, -C02R\ -SO3H, -SOR^ 
15 -S02R\ -SOsR^ •OC(=0)OR\ -C(=0)H, -C(=0)R^ -OC(=0)R^ -NR^2, -C(=0)NH2, -C(=0)NR'2, 
-N(R*)C(=0)OR', -N(R^)C(=0)NR'2, -0C(=0)NR*2, -N(R^)C(=0)R\ .C(=S)NR'2, -NR^C(=S)R', 
-S02NR*2, -NR^SOiR^ -N(R^)C(=S)NR'2, -N(R')S02NR'2, -R^ or -Z^R^ 

is O, S, Se orNR'. 

R^ is independently H, Ci-ghydrocarbyl, Ci-ghydrocarbyl substituted with one or more Sub^, 
20 Ci^gheterohydrocarbyl or Ci^eterohydrocarbyl substituted with one or more Sub^. 

Sub^ is independently halogen, trihalomethyl, -NO2, -CN, ->r'(Ci^alkyl)20-, -CO2H, -C02Ci^alkyl, 
-SO3H, -SOCi^lkyl, -SOaCi^alkyl, -SOaCi^alkyl, .OC(=0)OC,^alkyI, -C(=0)H, .C(=0)Ci^alkyl, 
-OC(=0)C,^alkyU -N(C,^alkyl)2, -C(=0)NH2, .C(=0)N(C,^aIkyI)2, 

-N(C,^lkyl)C(-0)0(Ci^alkyl), -N(C,^aIkyl)C(=0)N(C,^alkyl)2, -0C(=0)N(C,^alkyl)2, 
25 -.N(Ci-6alkyl)C(=0)C,^alkyl, -C(=S)N(Ci_6alkyl)2, -N(Ci^alkyl)C(=S)Ci^alkyl, -S02N(C,^alkyl)2, 
-N(C,-6aIkyl)S02C,^alkyl, -N(C|.6alkyl)C(=S)N(C,^alkyl)2, -N(Ci^lkyl)S02N(C,^alkyl)2, Ci^galkyl 
or -Z'Ci^alkyl. 

Where reference is made to a substituted group, the substituents are preferably from 1 to 5 in number, 
most preferably 1. 

30 However, molecular mass labels of the invention will generally comprise 1 or more, typically 
between 1 and 100 (e.g. 1 to 50, preferably 1 to 20) substituents Sub^ or Sub^, typically F or I, in order 
to vary the masses of the molecular mass labels. 

Preferred examples of substituent A are shown in figure 14. 
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Miscellaneous 

A may optionally be a monovalent dendrimer radical or a monovalent dendrimer radical substituted 
with one or more substituents Sub^ 

General 

The temi "comprising" means "including" as well as "consisting** e.g. a composition "comprising*' X 
may consist exclusively of X or may include something additional e.g. X + Y. 

The term "about" in relation to a numerical value x means, for example, x±10%. 

The word "substantially" does not exclude "completely" e.g. a composition which is "substantially 
jfree" from Y may be completely free from Y. Where necessary, the word "substantially" may be 
omitted from the definition of the invention. 



Tables 

Table 1 — C^isa cation 



Formula 


Structure 


Formula (I) 


(Ai2)„— C- [Ar'-(LM- {M-- Bp'}p)q]„, 

® 


Formula (lib) 


(Ar^)— C— [At'— (LM-{M}p)q]m 

xe 


Formula (IHb) 


(Ar2)„— C— [Ar'-(LM-{M'-Bp'}p)q]„ 
© 

xe 


Formula (P/bii) 


^Ar' (Lm {M}p)q 

(Ar2)„ — c— [At'— (LM-{M}p)q]„., 
© 

xe 


Formula (TVbiii) 


y 

(Ar2)„.,— C— [Ar'— (LM-{M}p)q]„, 
© 

X© 
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Formula (TVbiv) 


Ar^ (LM{M}p)q., 

© 
X© 






Formula (Vbii) 


Ari (LM{M'-Bp'}p)q 

(Ar2)„— C— (At'— (Lm{M'— Bp'}p)q]™., 
© 

X© 


Formula (Vbiii) 


AT 

fAi^')n 1 C — [At* — fLM— ^M'— Bp'>»V1™ 

© *^ ^ 

X© 




{Bp'— M'}p.,LM{M'-Bp'} 


Formula (Vbiv) 


Ar» (LM{lvr Bp'}p)q., 
(Ar2)„— C— [Ar»— (LM{M'-Bp'}p)q]™., 

© 

X© 



Table 2 — n-2,m — l,p — 1 and q = 1 



Formula 


Structure 


Formula (I) 


Ar^ 

Ai2— C— Ar'— LmM'— Bp' 


Formula (Ha) 


Ai^ 

Ar^— C— Ar^— LmM 
1 

X 


Formula (lib) 


Ai^ 

Ar^— C— At'— LmM 
X* 
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Ar2 


Formula (Ilia) 


Ai^— C— At*— LmM'- Bp' 
X 




Ar2 


Formula (Illb) 


Ar^— C— Ar'— LmM"- Bp' 
X* 




Ai^ 


rormuia ^ivai_; 


Ai^— C— At*— LmM 
(k) 






Formula (TVaii) 


^V-LmM 
Ar2_^_Ar2 

X 


Formula (IVaiii) 


Ar^ 

Ar^ — C— Ar'— LmM 
X 






Formula (IVaiv) 


Ar' 

Ar^— C-Ar^ 
1 

X 








Formula GVbii) 


'At- LmM 

Ai2_c-Ar2 
★ 

X* 


Formula (TVbiii) 


y 

Ar^ — C— At'— LmM 
★ 
X* 
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LmM"— -(sT) 


Formula (TVbiv) 


L 

A^C— Ar^ 
★ 




Ai^ 


M Vwwt Ilia f\Zoi 1 

JrOITnUla ^vai^ 


Ar^— C— At'— LmM' — Bp' 
(±) 






r\Ji.llllXla ^ V all J 


'At'-LmM'-Bp' 

Ai^— C-Ai^ 
1 

X 






Formula (Vaiii) 


Ar^ 

Ar^ — C — At' — LmM"- Bp' 
X 




LmM*- Bp- 


Formula rVaiv^ 


L 

Ar^— C— Ar^ 
1 

X 








Formula (Vbii) 


'At'-LmM'-Bp' 

Ar^— C— Ai^ 
★ 

X* 






Foimula (Vbiii) 


f 

Pcp- — C— Ar*— LMM■-Bp• 
X★ 
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LmM'-Bp' 




V 


Foimula (Vbiv) 




Ar^— C— Ar^ 




* 







BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 demonstrates conceptually the effect of the signal on a mass spectrum by a compound of 
formula (Ila) or (Hb) of the invention. Free biopolymer, such as a peptide, has poorer desorption 
5 properties characterised by a smaller peak on the left of mass-spectrum whereas desorption improves 
when the same molecule is conjugated to a compound of the invention. 

Figure 2 shows the steps of biopolymer with a compound of formula (TVai). The derivativisation of a 
biopolymer with a compound of the invention can be carried out more conveniently by utilising the 
solid phase-based format, whereby the compound is temporarily covalently attached to a solid 
10 support. This eliminates all the separation steps associated with homogenous approach as the only 
additional step required would be a washing step. The solid support can be a resin, a surface or a 
porous surface. Alternatively, the solid support may be a mass-spectrometry sample plate, which 
dramatically decreases the sample preparation time. Both gold, glass- and plastic-coated plates are 
compatible with this approach. 

15 Figure 3 shows the steps of ^reverse* biopolymer derivativisation on a covalent solid support whereby 
the release of the biopolymer derivative happens simultaneously with the derivativisation process. 
The process is applicable M groups involving leaving groups. 

Figure 4 shows the steps of biopolymer derivativisation on an ionic solid support. 

Figure 5 shows of the steps of solid support-assisted biopolymer derivativisation. The biopolymer is 
20 first trapped onto a solid support and then labelled with a compound of formula (Ila) or (lib). An 
advantage of this technique is that a preliminary sample enrichment occurs, since not all of the 
biopolymer in the sample will stick to the solid support surface. 

Figure 6 shows the mass spectrum obtained when analysing an Gly-Gly-O-acyl dipeptide conjugated 
with a trityl compound of the invention. 

25 Figure 7 shows the mass spectrum obtained when analysing a conjugate of a peptide with a trityl 
compound of the invention. 

Figure 8 compared the mass spectra of a BSA digest without (8A) and with (SB) labelling. 

Figure 9 shows the mass spectrum obtained when analysing a mixture of trityl-labelled amines. 

Figures lOA and lOB show preferred examples of group Lm. 

30 Figures 1 1 A and 1 IB show preferred examples of group M. 
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Figures 12A and 12B shows preferred examples of groups Ar* and Ar^. 
Figure 13 shows preferred examples of groups X and X*. 
Figure 14 shows preferred examples of substituent group A. 

MODES FOR CARRYING OUT THE INVENTION 
5 Materials and Methods 

The solid supports were Tenta Gel Macrobeads OH and NH2, 280-320 microns, Rapp Polymer. 
(MA)LDI-TOF mass-spectra were recorded on a PE-ABI Voyager™ Elite Reflectron Delayed 
Extraction Instrument. TLC were carried out with Merck silica gel (Kieselgel 60 F254 precoated 
plates and Kieselgel 60 0.040-0.063 mm). HPLC was carried out on a Waters system (Milford, MA, 
10 USA). Phosphoroamidite couplings were carried out in an ABI 394 DNA/RNA synthesiser. 
Chemicals and solvents were from Sigma/Aldrich/Fluka (USA), and BDH/Merck. 

Example 1 — Conjugation of a trityl tag (in solution phase) with solid support-bound biopolymer 
A 15mer poly-T oligonucleotide was synthesised on an ABI 394 DNA synthesiser using a T CPG 
support according to standard protocols of phosphoramidite chemistry on 0.2 fimol scale. After the 

15 last coupling, a MMTr-protected 'aminolink* phosphoramidite (Glen Res., USA) was added to a 
growing chain and deprotected using standard deblocker (2% DCA in DCM), The column was 
removed from the synthesiser, and after 10 min wash with acetonitrile it was attached to two 5 ml 
syringes and washed with a O.IM solution of NHS-activated 4,4 -dimethoxy-4"-carboxy ethyl trityl 
for 10 min at RT. The column was then washed with (3x10 ml) acetonitrile, placed on a DNA 

20 synthesiser and deprotected with ammonia according to standard protocols. The residue obtained 
after the evaporation was dissolved in 0.1 ml of 2M LiC104 and precipitated from cold acetone (1.5 
ml). The precipitate was washed with 0.5ml of acetone and dried. 

Example 2 — Homogenous conjugation of a trityl with non-polymeric ligands 

A solution of NHS-activated 4,4*-dimethoxy-4"-carboxyethyl trityl (O.IM) in THF/dioxane (1:1) was 

25 mixed with a solution (0.5-1M) of an amine or of a mixture of amines (for example, propyl amine, 
butyl amine, pentyl amine, hexyl amine and phenethyl amine), typically 10 ml of a solution of an 
activated trityl with 5 ml of an amine solution. The mixtures were purified on prep-TLC (2mm-thick 
glass plates with UV254 indicator, Analtech/Aldrich-Sigma), typically in chloroform with 0.5% 
triethylamine. The areas containing the desired products were scratched off the plate, and the 

30 conjugates or the mixtures thereof were eluted using same solvent with 2-5% MeOH, filtered through 
a layer of glass wool, evaporated and dried. 

Example 3 — Homogenous conjugation of a nhs-activated trityl with polymeric ligands 
A peptide, an oligonucleotide, or any other biopolymer containing a (primary) amino group, is 
dissolved in a mixture of water and acetonitrile depending on its solubility, typically 20-50% of 
35 water in CH3CN. Non-aminogroup-containing buffers (ie. 50 mM sodium phosphate, 0.15 M NaCl, 
pH 7.2, or a bicarbonate buffer, but an additional desalting step may then need to be introduced to cut 
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ofiF the metal ions prior to mass-spectrometry) can be used to keep the pH at between 7-9. For 
particularly poorly soluble ligands other solvents may be used such as THF, DMSO, etc. 

A solution of NHS-activated 4,4 -dimethoxy-4"-carboxyethyl trityl in acetonitrile or THF is added in 
approx. 5-10 times excess compared to an amine component. Conjugation usually reaches the 
5 maximum yield over 2-4 hours of reaction time. The conjugate formed can be analysed by MS 
directly, or after HPLC-purification. 

Example 4 — Conjugation of a solid phase-immobilised nhs-aciivated trityl tag with a ligand 

A Solid Phase-Immobilised NHS-Activated Trityl Tag was prepared by either method 1 or method 2. 

Method 1 : A NHS-Activated 4,4*-dimethoxy-4' -carboxyethyl trityl tag was covalently attached to 
10 hydroxy 1 groups of 200 |im Rapp Polymer beads by shaking the suspension of 100 mg of the resin in 
5 ml of 0.1 M solution of trityl chloride tag in dry pyridine at +4**C for 3 hours and then washing the 
resin with pyridine and acetonitrile and drying in vacuo. 

Method 2 . A 5'-tritylated thymidine phosphoramidite was prepared from NHS-activated 4,4- 
dimethoxy-4"-carboxyethyl trityl chloride in a standard way [M J. Gait, Oligonucleotide Synthesis: A 

15 Practical Approach, IRL, Oxford, 1984], The Rapp Polymer beads (2 x 40 mg) were placed in two 1 
micromol scale DNA synthesis columns (Glen Res.). The first column was coupled with the said 
phosphoramidite on an ABI DNA synthesiser using manual supply of reagents (O.IM solution of a 
phosphoramidite and other standard phosphoramidite synthesis reagents) with a coupling step of 15 
min. The second column was first derivatised with a trebler phosphoramidite (Glen Res.) according 

20 to the manufacturer's protocols and then coupled with the trityl tag-containing phosphoramidite as 
described for the first column. Both columns were excessively washed with acetonitrile. 

The trityl loading of the solid supports produced by either method was determined 
spectrophotometrically (absorbance measurements at 490nm) to be 0.21 mmol/g for a straight 
attachment and 0.39 mmol/g for a tritylation on top of the trebling synthon. (The hydroxyl group 
25 loading of the Rapp polymer used was 0.25mmol/g). 

To the solid support prepared as described above, a mixture of compounds to be labelled (typically 
peptides) is added, typically in a mixture of 20-50% water in acetonitrile. After incubation, with 
occasional shaking, for 60-120 min the resin is washed with several volumes of the same solvent, and 
the conjugated products are cleaved off the resin, typically by adding 0.5-2% TFA in appropriate 
30 solvent. The collected sample is then analysed by MS. 

Example 5 — Mass spectrometry analysis of a derivatised Gly-Gly dipeptide 

Figure 6 shows the mass spectrum obtained from a compound of the invention comprising a 
derivatised Gly-Gly-O-acyl dipeptide biopolymer. 

The ion of formula (J) containing the derivatised Gly-Gly-O-acyl biopolymer is observed at the peak 
35 at molecular weight 516.5. There was no peak corresponding to the free dipeptide. 
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The fragment of formula (VI), in which the derivatised Gly-Gly-O-acyl biopolymer has been lost, is 
observed at the peak at the molecular weight 374.6. 

Example 6 — Mass spectrometry analysis of a derivatised peptide 

Figure 7 shows the mass spectrum obtamed from a compound of the invention comprising a 
5 derivatised peptide biopolymer. The free peptide had a molecular weight of 3 1 0. 

The ion of formula (I) containing the derivatised peptide biopolymer is observed at the peak at 
molecular weight 665.0, 

The fragment of formula (VI), in which the derivatised peptide has been lost, is observed at the peak 
at the molecular weight 375.0. 

10 Significantly, there is only a very small peak at molecular weight 310, where a peak corresponding to 
the free biopolymer would be found. The relative size of the peaks at 665.0 and 310 thus demonstrate 
the significantly improved ionisability of the compounds of the invention compared with free 
biopolymer. 

Example 7 — Spectral improvement by trityls 

15 Three proteins (BSA, p-casein and ADH) were digested with trypsin and the resulting peptides 
analysed by MALDI-TOF mass spectrometry with or without derivatisation. The number of peptides 
identified for each protein is shown below. The theoretical total number of peptides that would be 
produced by trypsin digestion of each protein was calculated in silica and is shown in the second 

column the table below. 



Protein 


Number of 
theoretical peptides'*^ 


Total number of peptides identified 


MASCOT search score* 


Underivatised 


Derivatised 


Underivatised 


Derivatised 


BSA 


144 


14 (10%) 


41 (28%) 


132 


126 


P-casein 


27 


4(15%) 


13 (48%) 


no match 


123 


ADH 


60 


7 (12%) 


18(30%) 


77 


111 



20 + The number of theoretical peptides for each protein was generated assuming one 

missed cleavage and disregarding di- and mono-amino acids generated. 

* Score is -10*Log(P), where P is the probability that the observed match is a 
random event. Protein scores greater than 63 are significant (p<0.05). 

Derivatisation of peptides with trityl groups of the invention thus improves detection, as a 
25 significantly larger number of peptides was detected for each of the three proteins when 
derivatisation was used. Furthermore, protein identification by mass fingerprinting can be improved. 

Taking p-casein as an example, the number of detectable fragments more than tripled, and the 
derivatised spectrum allowed a MASCOT-based identification which was not previously possible. 

Example 8 — BSA fragmentation and mass spectrometry 
30 Bovine serum albumin (BSA) was digested with trypsin and analysed by MALDI-TOF. The resulting 
spectrum is shown in Figure 8A. The experiment was repeated, but the peptide mixture was labelled 
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with a dimethoxytrityl label after trypsin digestion. The spectrum in Figure 8B shows the dramatic 
increase in visible ions due to the trityl label. Four specific peptides have been highlighted in both 
spectra. 

Example 9 — Mass spectrometry of amines 
5 A solution of NHS-activated 4,4'-dimethoxy-4"-carboxyethyl trityl (O.IM) in THF/dioxane (1 :1) was 
mixed with a solution (0.5-lM) of an amine or of a mixture of amines (for example, propyl amine, 
butyl amine, pentyl amine, hexyl amine and phenethyl amine), typically 10 ml of a solution of an 
activated trityl with 5 ml of an amine solution. The mixtures were purified on prep-TLC (2mm-thick 
glass plates with UV254 indicator, Analtech/Aldrich-Sigma), typically in chloroform with 0.5% 
10 triethylamine. The areas containing the desired products were scratched off the plate, and the 
conjugates or the mixtures thereof were eluted using same solvent with 2-5% MeOH, filtered through 
a layer of glass wool, evaporated and dried. Figure 9 shows a spectrum obtained in this way. 

It will be understood that the invention is described above by way of example only and modifications 
may be made whilst remaining within the scope and spirit of the invention. 
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